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DECLARATION UNDER 37 C.F.R. §1,132 

I, Dr. Marie-Lise GOUGEON, declare that: 

1. I am a Research Director in Pasteur Institute, head of the Antiviral 
Immunity, Biotherapy and Vaccine Unit. 

2. I have read and understood Application Serial No. 10/076,370, including 
pending claims 23-28 and 31-39, as well as withdrawn claims 40-50, copies of which 
are attached hereto, 

3. I am not an inventor or owner of Application Serial No. 10/076,370, or the 
subject matter contained therein. 

4. I understand that the Examiner has rejected claims 23-28 and 31-39 of 
this application as allegedly lacking written description support in the specification. 

5. I have been asked to comment on the disclosure of the genus of 
immunogenic polypeptide fragments comprising HIV-1 M al epitopes of 5-150 amino acid 
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residues, wherein at least one amino acid residue is substituted at one of the specific 
positions, in Application Serial No. 10/076,370 (the '370 application). 

6. Page 23 of the '370 application describes peptides, included in the 
invention, comprising or consisting of the conserved regions (which have the lengths of 
21, 43, 79, 94, and 131, respectively). Describing a peptide spanning positions 680-700 
necessarily describes a peptide with a length of 21 amino acid residues. 

7. The identity of the mutations encompassed by the disclosure in the '370 
application can be ascertained by considering Figure 3. Comparison of the various Env 
sequences in Figure 3 highlights positions where an amino acid is substituted in all of 
the sequences designated LAVmal, ARV2, and LAV E u when compared to LAVbru. 
Amino acid positions substituted in all of the sequences designated LAVmal, ARV2, and 
LAVeli when compared to LAVbru are called nonconserved amino acids. 

8. The fact that the nonconserved amino acids differ in all of the sequences 
designated LAVmal, ARV2, and LAVeli when compared to LAVbru suggests that each 
nonconserved amino acid is not required for immunogenicity. Viewing Figure 3, it is 
clear to me which amino acids are nonconserved. The claims include a listing of these 
nonconserved amino acids, which are targets for mutation. 

9. The '370 application states on page 23 that "[pjroteins containing or 
consisting of the 'well conserved stretches' are of particular interest for the production of 
immunogenic compositions and (preferably in relation to the stretches of the env 
protein) of vaccine compositions against the LAV-1 viruses." This statement implies 
that nonconserved amino acids, such as those provided in claim 23 as derived from 
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Figure 3, are not essential for immunogenicity of the peptides, the "well conserved 
stretches" being of particular interest for the production of immunogenic compositions. 
This was known for other antigenic systems (Tanabe et al. a J. Biochem 96:365 (1984), 
Lacal et al., Mol Cell Biol 6:1002 (1986)) at the time of filing of this application. Several 
recent studies support this statement for HIV, both at the cellular (Frahm et al., J. Virol 
78:2187 (2004) and humoral (Yang et aL, J. Virol. 78:4029 (2004)) level, and the use of 
consensus sequences in vaccine design to minimize the genetic differences between 
vaccine strains and contemporary isolates is currently being considered (Gaschen et al., 
Science 296:2354, 2355, second paragraph (2004); and Gallo, Lancet, early online 
publication, last paragraph of p.3 (September 28, 2005); Burton et al., Nature Immunol. 
5:233-236, 235 'strategies for immunogen design* paragraph (2004)) 

10-. Furthermore, claim 40 recites a structural requirement and a functional 
requirement for the claimed immunogenic peptides. Claim 40 recites an immunogenic, 
HiV-1 Env peptide of at least 21 amino acid residues of LAVma L Env sequences in 
Figures 3E-3F comprising at least one of the following conserved sequences: 

a) amino acid^residues 680-700; 

b) amino acid residues 488-530; 

c) amino acid residues 21 1-289; 

d) amino acid residues 37-130; and 

e) amino acid residues 490-620; 
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wherein said peptide binds to antibodies in AIDS patient sera; and wherein said 
antibodies are capable of binding to viral antigens encoded by the LAVmal molecular 
clone having C.N, CM. accession number 1-641. 

1 1 . The structural requirement of the claimed immunogenic peptides in claim 
40 is the at least one conserved sequence. 

12. The functional requirement of the peptides in claim 40 is that the peptides 
having the at least one conserved sequence are immunogenic. 

13. The specification identifies a correlation between these conserved 
sequences and this function of the claimed peptides. It states that the peptides 
containing the well conserved regions are of "particular interest for the production of 
immunogenic compositions and (preferably in relation to the stretches of the eny 
protein) of vaccine compositions against the LAV-1 viruses." See specification, page 
23. The specification suggested in 1986 that conserved regions of HIV proteins 
contribute to the induction of neutralizing antibodies to those proteins. This has been 
confirmed. See Holmbach et al., J. of Virology 67:1612-1619, 1617, second column, 
first and last paragraph (1993); Spenlehauer et al., J. of Virology 72:9855-9864, 9862, 
last paragraph of thefirst column (1998). 

14. I further declare that all statements made herein of my own knowledge are 
true and that all statements made on information and belief are believed to be true, and 
further, that these statements were made with the knowledge that willful false 
statements and the like so made are punishable by fine or imprisonment, or both, under 
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Section 1001 of Title 18 of the United States Code, and that such willful false 
statements may jeopardize the validity of the application or any patent issuing thereon. 



Dated: 




Dr.VM&rie-Lise Gougeon 
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POSITIONS AND AWARDS 

1975 : Roux Fundation Fellowship 
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Scientific commitees and editorial activity 

• President of the « European Cell Death Organisation » (ECDO) 1999-2001. 
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• Member of several scientific commitees in ANRS 

• Member of the scientific commitee « HIV and therapy », ABBOTT Company (1999-2005) 

• Member of the scientific commitee « Lipodystrophy » BMS Company (2000-2004) 

• Member of the scientific commitee ATC Biotherapy, INSERM (2002- ) 

• Associate Editor of Cell Death and Differentiation (1997-2003) 

• Member of the Editorial Board of Current Molecular Medicine (2003- ) 

• Member of the Editorial Board of AIDS (2001- ) 

CONSULTANT CONTRACTS 
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U.S. Patent Application Serial No. 10/076,370 
1-22. (Canceled) 

23. (Previously Presented) An immunogenic, HIV-1 Env peptide of 5-150 
amino acid residues of LAVmal Env sequences in Figures 3E-3F having at least one 
amino acid substitution at one or more of positions 8, 9, 90, 102, 131, 133, 140, 156, 
172, 177, 179, 185, 188, 192, 198, 207, 209, 290, 305, 308, 323, 333, 335, 337, 341, 
342, 353, 356, 359, 363, 404, 428, 440, 457, 41, 477, 483, 484, 486, 538, 555, 641, 
652, 656, 660, 663, 694, 740, 733, 799, 854, 856, 862, and 875 in relation to Env 
sequences of LAV B ru, LAV A rv2, and LAVeli; wherein said peptide binds to antibodies in 
AIDS patient sera; and wherein said antibodies are capable of binding to viral antigens 
encoded by the LAVmal molecular clone having C.N.C.M. accession number 1-641. 

24. (Previously Presented) The peptide of claim 23, wherein said peptide is 
generated by chemical cleavage. 

25. (Previously Presented) The peptide of claim 23, wherein said peptide is 
expressed from a recombinant DNA. 

26. (Previously Presented) The peptide of claim 23, wherein said peptide is 
generated by chemical synthesis. 

27. (Previously Presented) A method for detecting antibodies to HIV in a test 
sample comprising: 

a) providing at least one peptide of claim 23 affixed to a solid support; 

b) combining a test sample with the at least one peptide affixed to the solid 
support; 

c) optionally rinsing the solid support to remove unbound antibodies of the test 
sample; and 

d) detecting peptide-antibody complex formed, which is indicative of the 
presence of HIV antibodies in the test sample. 

28. (Previously Presented) A method of eliciting neutralizing antibodies to HIV 
in a mammal comprising: 

a) providing a composition comprising at least one peptide of claim 23, a 
suitable pharmaceutical^ or physiologically acceptable carrier, and optionally an 
adjuvant; 

b) immunizing the mammal with the composition; and 
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c) optionally testing a blood sample from the mammal to assay for the binding 
affinity and neutralizing activity of the elicited antibodies. 

29-30. (Canceled) 

31. (Previously Presented) The peptide of claim 23, wherein the peptide 
comprises at least one of the following conserved sequences: positions 37-130, 211- 

289, 488-530, 490-620, and 680-700 of Env as shown in Fig. 3E-F. 

32. (Previously Presented) A method for detecting antibodies to HIV in a test 
sample comprising: 

a) providing at least one peptide of claim 31 affixed to a solid support; 

b) combining a test sample with the at least one peptide affixed to the solid 
support; 

c) optionally rinsing the solid support to remove unbound antibodies of the test 
sample; and 

d) detecting peptide-antibody complex formed, which is indicative of the 
presence of HIV antibodies in the test sample. 

33. (Previously Presented) A method of eliciting neutralizing antibodies to HIV 
in a mammal comprising: 

a) preparing a vaccine comprising at least one peptide of claim 31, a suitable 
pharmaceutical^ or physiologically acceptable carrier, and optionally an adjuvant; 

b) immunizing the mammal with the vaccine; and 

c) optionally testing a blood sample from the mammal to assay for the binding 
affinity and neutralizing activity of the elicited antibodies. 

34. (Previously Presented) An immunogenic, HIV-1 Env peptide of at least 21 
amino acid residues of LAVmal Env sequences in Figures 3E-3F having at least one 
amino acid substitution consisting of an amino acid substitution at one or more of 
positions 8, 9, 90, 102, 131, 133, 140, 156, 172, 177, 179, 185, 188, 192, 198, 207, 209, 

290, 305, 308, 323, 333, 335, 337, 341, 342, 353, 356, 359, 363, 404, 428, 440, 457, 
41, 477, 483, 484, 486, 538, and 555, 641, 652, 656, 660, 663, 694, 740, 733, 799, 854, 
856, 862, 875; wherein said peptide binds to antibodies in AIDS patient sera; and 
wherein said antibodies are capable of binding to viral antigens encoded by the LAVmal 
molecular clone having C.N.C.M. accession number 1-641. 

35. (Previously Presented) The peptide of claim 34, wherein the peptide has 
21 amino acids. 
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36. (Previously Presented) The peptide of claim 34, wherein the peptide has 
43 amino acids. 

37. (Previously Presented) The peptide of claim 34, wherein the peptide has 
79 amino acids. 

38. (Previously Presented) The peptide of claim 34, wherein the peptide has 
94 amino acids. 

39. (Previously Presented) The peptide of claim 34, wherein the peptide has 
131 amino acids. 

40. (Withdrawn) An immunogenic, HIV-1 Env peptide of at least 21 amino 
acid residues of LAVmal Env sequences in Figures 3E-3F comprising at least one of the 
following conserved sequences: 

a) amino acid residues 680-700; 

b) amino acid residues 488-530; 

c) amino acid residues 21 1-289; 

d) amino acid residues 37-130; and 

e) amino acid residues 490-620; 

wherein said peptide binds to antibodies in AIDS patient sera; and wherein said 
antibodies are capable of binding to viral antigens encoded by the LAVmal molecular 
clone having C.N. CM. accession number 1-641. 

41 . (Withdrawn) A method for detecting antibodies to HIV in a test sample 
comprising: 

a) providing at least one isolated HIV-1 LAVmal Env peptide consisting of 5-150 
amino acid residues as set forth in Figures 3E-3F affixed to a solid support, wherein 
said peptide contains a LAVMAL-specific antigenic determinant; 

b) combining a test sample with the at least one peptide affixed to the solid 
support; 

c) optionally rinsing to remove unbound antibodies of the test sample; 

d) detecting peptide-antibody complex formed, which is indicative of the 
presence of HIV antibodies in the test sample. 

42. (Withdrawn) The method of claim 41 , wherein said peptide is generated 
by chemical cleavage. 
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43. (Withdrawn) The method of claim 41, wherein said peptide is expressed 
from a recombinant DNA. 

44. (Withdrawn) The method of claim 41 , wherein said peptide is generated 
by chemical synthesis. 

45. (Withdrawn) The method of claim 41 , wherein said peptide binds to 
antibodies in AIDS patient sera; and wherein said antibodies are capable of binding to 
viral antigens encoded by the LAVmal molecular clone having C.N.C.M. accession 
number 1-641. 

46. (Withdrawn) A method of eliciting neutralizing antibodies to HIV in a 
mammal comprising: 

a) providing a vaccine comprising the an isolated HIV-1 LAVmal Env peptide 
consisting of 5-150 amino acid residues as set forth in Figures 3E-3F, wherein said 
peptide contains a LAVMAL-specific antigenic determinant, at least one suitable buffer, 
and optionally an adjuvant; 

b) immunizing the mammal with the vaccine; 

c) optionally testing a blood sample from the mammal to assay for the affinity 
and activity of the antibodies. 

47. (Withdrawn) The method of claim 46, wherein said peptide is generated 
by chemical cleavage. 

48. (Withdrawn) The method of claim 46, wherein said peptide is expressed 
from a recombinant DNA. 

49. (Withdrawn) The method of claim 46, wherein said peptide is generated 
by chemical synthesis. 

50. (Withdrawn) The method of claim 46, wherein said peptide binds to 
antibodies in AIDS patient sera; and wherein said antibodies are capable of binding to 
viral antigens encoded by the LAVmal molecular clone having C.N.C.M. accession 
number 1-641. 
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BACKGROU ND OF THE INVENTION 

The present invention relates to a virus ca- 
pable of inducing lymphadenopathies (hereinafter "LAS") 
and acquired immuno-depressive syndromes (hereinafter 
"AIDS"), to antigens of this virus, particularly in a 
purified form, and to a process for producing these an- 
tigens, particularly antigens of the envelope of this 
virus. The invention also relates to polypeptides, 
whether glycosylated or not, produced by the virus and 

5 

to DNA sequences which code for such polypeptides. The 
invention further relates to cloned DNA sequences hybri- 
dizable to genomic RNA and DNA of the lymphadenopathy 
associated virus (hereinafter "LAV") of this invention 
and to processes for their preparation and their use. 
The invention still further relates to a stable probe 
including a DNA sequence which can be used for the 
detection of the LAV virus of this invention or related 
viruses or DNA, proviruses in any medium, particularly 
biological, and in samples containing any of them. 

An important genetic polymorphism has been re- 
cognized for the human retrovirus which is the cause of 
AIDS and other diseases like LAS, AIDS-related complex 
(hereinafter "ARC") and probably some encephalopathies 
(for review, see Weiss, 1984). Indeed all of the isola- 
tes, analyzed until now, have had distinct restriction 
maps, even those recovered at the same place and time 
[Benn et al . , 1985]. Identical restriction maps have 
only been observed for the first two isolates which were 
designated LAV [Alizon et al . , 1984] and human T-cell 
lymphotropic virus type 3 (hereinafter " HTLV-3 " ) [Hahn 
et al., 1984] and which appear to be exceptions. The 
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genetic polymorphism of the AIDS virus was better asses- 
sed after the determination of the complete nucleotide 
sequence of LAV [Wain-Hobson et al . , 1985], HTLV-3 
5 [Ratner et al., 1985 ; Muesing et al. , 1985] and a third 
isolate designated AIDS-associated retrovirus (herein- 
after " ARV2 " ) [Sanchez-Pescador et al., 1985]. in 
particular, it appeared that, besides the nucleic acid 
variations responsible for the restriction map polymor- 
1Q phism, isolates could differ significantly at the pro- 
tein level, especially in the envelope (up to 13 % of 
difference between ARV and LAV) , by both amino acids 
substitutions and reciprocal insertions-deletions 
[Rabson and Martin, 1985]. 
15 Nevertheless, such differences did not go so 

far as to destroy the immunological similarity of such 
isolates as evidenced by the capabilities of their 
similar proteins, (e.g., core proteins of similar 
nature, such as the p25 proteins, or similar envelope 
2q glycoproteins, such as the 110-120 kD glycoproteins) to 
immunologically cross-react. Accordingly, the proteins 
of any of said LAV viruses can be used for the in vitro 
detection of antibodies induced in vivo and present in 
biological fluids obtained from individuals infected 
with the other LAV variants. Therefore, these viruses 
are grouped together as a class of LAV viruses (herein- 
after "LAV-1 viruses"). 
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SUMMARY OF THE INVENTION 

In accordance with this invention, a new virus 
has been discovered that is responsible for diseases 
clinically related to AIDS and that can be classified as 
a LAV-1 virus but that differs genetically from known 
LAV-1 viruses to a much larger extent than the known 
LAV-1 viruses differ from each other. The new virus 
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is basically characterized by the cDNA sequence which is 
shown in Figures 7A to 71, and this new virus is 
hereinafter generally referred to as "LAV 

Also m accordance with this invention, 
variants of the new virus are provided. The RNAs of 
these variants and the related cDNAs derived from said 
RNAs are hybridizable to corresponding parts of the cDNA 
of LAV MAL- The DNA of the new virus also is provided, as 
well as DNA fragments derived therefrom hybridizable 
with the genomic RNA of LAV MAL , SU ch DNA and DNA 
fragments particularly consisting of the cDNA or cDNA 
fragments of LAV MAL or of recombinant DNAs containing 
such cDNA or cD.NA fragments. 

DNA recombinants containing the DNA or DNA 
fragments of LAV mL or its variants are also provided. 
It is of course understood that fragments which would 
include some deletions or mutations which would not 
substantially alter their capability of also hybridizing 
with the retroviral genome of LAV MftL are to be conside- 
red as forming obvious equivalents of the DNA or DNA 
fragments referred to hereinabove. 

Cloned probes are further provided which can 
be made starting from any DNA fragment according to the 
invention, as are recombinant DNAs containing such 
fragments, particularly any plasmids amplifiable in 
procaryotic or eucaryotic cells and carrying said 
fragments. Using cloned DNA containing a DNA fragment of 
LAV MAL as a mol ecular hybridization probe - either by 
marking with radionucleotides or with fluorescent 
reagents - LAV virion RNA may be detected directly, for 
example, in blood, body fluids and blood products (e.g., 
in antihemophilic factors such as Factor VIII concentra- 
tes). A suitable method for achieving such detection 
comprises immobilizing LAV MAL on a support (e.g., a ni- 
trocellulose filter), disrupting the virion and 



hybridizing with a labelled ( radiolabeled or "cold" 
fluorescent- or enzyme-labelled) probe. Such an approach 
has already been developed for Hepatitis B virus in pe- 
ripheral blood (according to Scotto J. et al. Hepatology 
( 1983) , 3, 379-384 ) . 

Probes according to the invention can also be 
used for rapid screening of genomic DNA derived from the 
tissue of patients with LAV related symptoms to see if 
the proviral DNA or RNA present in their tissues is 
related to LAV MAL • A method which can be used for such 
screening comprises the following steps : extraction of 
DNA from tissue, restriction enzyme cleavage of said 
DNA, electrophoresis of the fragments and Southern • 
blotting of genomic DNA from tissues and subsequent 
hybridization with labelled cloned LAV provival DNA. 
Hybridization in situ can also be used. Lymphatic fluids 
and tissues and other non- lymphatic tissues of humans, 
primates and other mammalian species can also be 
screened to see if other evolutionary related retrovi- 
ruses exist. The methods referred to hereinabove can be 
used, although hybridization and washings would be done 
under non-stringent conditions. 

The DNA according to the invention can be used 
also for achieving the expression of LAV viral antigens 
for diagnostic purposes, as well as for the production 
of a vaccine against LAV. Fragments of particular 
advantage in that respect will be discussed later. The 
methods which can be used are multifold : 

a) DNA can be transfected into mammalian cells 
with appropriate selection markers by a variety of tech- 
niques, such as calcium phosphate precipitation, 
polyethylene glycol, protoplast-fusion, etc. 

b) DNA fragments corresponding to genes can be 
cloned into expression vectors for E. coli , yeast or 
mammalian cells and the resultant proteins purified. 



c) The provival DNA can be "shot-gunned" 
(fragmented) into procaryotic expression vectors to 
generate fusion polypeptides. 

Recombinants, producing antigenically competent fusion 
proteins,, can be identified by simply . screening the 
recombinants with antibodies against ^AV^ antigens. 
Particular reference in this respect is made to those 
portions of the genome of LAV MAL which, in the figures, 
are shown to belong to open reading frames and which 
encode the products having the polypeptidic backbones 
shown. 

Different polypeptides which appear in figures 
7A to 71 are still further provided. Methods disclosed 
in European application 0 178 978 and in PCT application 
PCT/EP 85/00548, filed Oct . 1 8 , 1985, are applicable for 
the production of such peptides from LAV . in this 
regard, polypeptides are provided containing sequences 
in common with polypeptides comprising antigenic deter- 
minants included in the proteins encoded and expressed 
by the LAV MAL genome. Means are also provided for the 
detection of proteins of LAV^ L , particularly for the 
diagnosis of AIDS or pre-AIDS or, to the contrary, for 
the detection of antibodies against LAV MAL or its 
proteins, particularly in patients afflicted with AIDS 
or pre-AIDS or more generally in asymtomatic carriers 
and in blood-related products. Further provided are 
immunogenic polypeptides and more particularly 
protective polypeptides for use in the preparation of 
vaccine compositions against AIDS or related syndroms. 

Yet further provided are polypeptide fragments 
having lower molecular weights and having peptide 
sequences or fragments in common with those shown in 
figures 7A to 71. Fragments of smaller sizes can be 
obtained by resorting to known techniques, for instance, 
by cleaving the original larger polypeptide by enzymes 
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capable of cleaving it at specific sites. By way of 
examples may be mentioned the enzyme of Staphvlococcvus 
aureus V8 , a-chymotrypsine , "mouse sub-maxillary gland 
protease" marketed by the Boehringer company/ Vibrio 
alginolyticus chemovar iophaaus collagenase, which 
specifically recognizes the peptides Gly-Pro, Gly-Ala, 
etc. 

Other features of this invention will appear 
in the following disclosure of data obtained starting 
from LAV MAL r in relation to the drawings. 



BRIEF DESCRIPTION OF THE DRAWINGS 

- Figs. 1A and 1B provide comparative restriction maps 
of the genomas of LAV MAL as compared to LAV £LI (Appli- 
cants' related new LAV virus which is the subject of 

their copending application , filed herewith) and LAV 

B RU 

(a known LAV isolate deposited at the Collection 
Nationale des Cultures de Micro-organismes (hereinafter 
"CNCM") of the Pasteur Institute, Paris, France under 
No. 1-232 on July 15, 1983) ; 

Fig. 2 shows comparative maps setting forth the 
relative positions of . the open reading frames of the 
above genomas ; 

- Figs. 3A-3F (also designated generally hereinafter 
"fig. 3") indicate the relative correspondance between 
the proteins (or glycoproteins) encoded by the open 
reading frames, whereby amino acid residues of protein 
sequences of LAV MAL are in vertical alignment with 
corresponding amino acid residues (numbered) of 
corresponding or homologous proteins or glycoproteins of 

LAV BRU' aS Wel1 aS LAV ELI and ARV Z ' 

- Figs. 4A-4B (also designated generally hereinafter 

"fig. 4") provide tables quantitating the sequence 

divergence between homologous proteins of LAV , LAV_ 

oR U EL I 



and LAV mL ; 

- Fig. 5 shows diagrammatically the degree of divergence 
of the different virus envelope proteins ; 

- Figs. 6A and 6B ("Fig. 6" when consulted together) 
render apparent the direct repeats which appear in the 
proteins of the different AIDS virus isolates. 

- Figs. 7A-7I show the full nucleotidic sequences of 
LAV MAL • 

DETAILED DESCRIPTION OF THE INVENTION 

CTARACTERIXATICOT AUTO Km,BCOTAR OL/DWIHIG OF AN 
AFRICAN ISGMLAYE. 

The different AIDS virus isolates concerned 
are designated by three letters of the patients name, 
LAV BRU referrin S to the prototype AIDS virus isolated in 
1983 from a French homosexual patient with LAS and 
thought to have been infected in the USA in the prece- 
ding years [Barre-Sinoussi et al . , 1983]. LAV_ was 

MAL 

recovered in 1985 from a 7-year old boy from Zaire. 
Related LAV £L][ was recovered in 1983 from a 24-year old 
woman with AIDS from Zaire. Recovery and purification of 
the LAV MAL virus were performed according to the method 
disclosed in European Patent Application 84 401834/138 
667 filed on September 9, 1984. 

LAV MAL is indistin 9uishable from the previous- 
ly characterized isolates by its structural and biologi- 
cal properties in vitro . Virus metabolic labelling and 
immune precipitation by patient MAL sera, as well as 

reference sera, showed that the proteins of LAV W „ had 

MAL 

the same molecular weight (hereinafter M MW" ) as, and 
cross-reacted immunologically with those of, prototype 
AIDS virus (data not shown) of the LAV-1 class. 

Reference is again made to European Applica- 
tion 178 978 and International Application PCT/EP 
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85/00548 as concerns the purification, mapping and 
sequencing procedures used herein. See also the 
discussion under the headings "Experimental Procedures" 
and "Significance of the Figures" hereinafter. 

Primary restriction enzyme analysis of LAV 

MA L 

genome was done by southern blot with total DNA derived 

from acutely infected lymphocytes, using cloned LAV 

B RU 

complete genome as probe. Overall cross-hybridization 
was observed under stringent conditions, but the res- 
triction profile of the Zainan isolate was clearly 
different. Phage lambda clones carrying the complete 
viral genetic information were obtained and further 
characterized by restriction mapping and nucleotide 
sequence analysis. A clone (hereinafter "M-H11") was 
obtained by complete Hindlll restriction of DNA from 
LAV MAL" infected cells > taking advantage of the existence 
of a unique Hindlll site in the long terminal repeat 
(hereinafter " LTR " ) . M-H11 is thus probably derived from 
unintegrated viral DNA since that species was at least 
ten times more abundant than integraded provirus. 

Figure 1B gives a comparaison of the restric- 
tion maps derived from the nucleotide sequences of 

LAV ELI' LAV MAL and Prototype ^ BRU , as well as from 
three other Zairian isolates (hereinafter "Z1" , M Z2 M , 
and "Z3 M respectively) previously mapped for seven 
restriction enzymes [Benn et al . , 1985]. Despite this 
limited number, all of the profiles are clearly 
different (out of the 23 sites making up the map of 
LAV BRU' only seven are Present in all six maps presen- 
ted), confirming the genetic polymorphism of the AIDS 
virus. No obvious relationship is apparent between the 
five Zairian maps, and all of their common sites are 
also found in LAV . 

B Ru 

Comservaltioni off One <gemie1tic orgamizaltioini. 

The genetic organization of LAV M ._ as deduced 



10 



15 



20 



25 



30 



35 



9 

from the complete nucleotide sequences of its cloned 
genome is identical to that found in other isolates, 
i.e., 5 ' gag-pol-central region-env-F3 1 . Most noticeable 
is the conservation of the "central region" (fig. 2), 
located between the pol and env genes, which is composed 
of a series of overlapping open reading frames 
(hereinafter "orf") previously designated Q, R, S, T, 
and U in the ovine lentivirus visna [Sonigo et al . , 
1985]. The product of orf S (also designated "tat") is 
implicated in the transactivation of virus expression 
[Sodroski et al . , 1985 ; Arya et al . , 1985] ; the 
biological role of the product of orf Q (also designated 
"sor" or "orf A") is still unknown [Lee et al . , 1986 ; 
Kang et al., 1986]. Of the three other orfs, R, T, and 
U, only orf R is likely to be a seventh viral gene, for 
the following reasons : the exact conservation of its 
relative position with respect to Q and S (fig. 2), the 
ponstant presence of a possible splice acceptor and of a 
consensus AUG initiator codon, its similar codon usage 
with respect to viral genes, and finally the fact that 
the variation of its protein sequence within the dif- 
ferent isolates is comparable to that of gag, pol and Q 
( see fig . 4 ) . 

Also conserved are the sizes of the U3, R and 
U5 elements of the LTR (data not shown), the location 
and sequence of their regulatory elements such as TATA 
box and AATAAA polyadenylation signal, and their 
flanking sequences, i.e., primer binding site 

JVC 

(hereinafter "PBS") complementary to 3' end of tRNA 
and polypurine tract (hereinafter "PPT"). Most of the 
genetic variability within the LTR is located in the 5' 
half of U3 (which encodes a part of orf F) while the 3' 
end of U3 and R, which carry most of the cis-acting 
regulatory elements, promoter, enhancer and 

trans-activating factor receptor [Rosen et al . , 1985], 
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as well as the U5 element, are well-conserved. 

Overall, it clearly appears that this Zairian 
isolate, LAV MAL » is the same type of retrovirus as the 
previously sequenced isolates of American or European 
origin. 

Variability off 1tlne ^iiral proteiins . 

Despite their identical genetic organization, 
the LAV^j and LAV^ L shows substantial differences in 
the primary structure of their proteins. The amino acid 
sequences of LA V^, ^ and LAV MAL proteins are presented in 
figures 3A-3F, aligned with those of LAV n ___ and ARV 2. 
Their divergence was quantified as the percentage of 
amino acids substitutions in two-by-two alignments (Fig. 
4). The number of insertions and deletions that had to 
be introduced in each of these alignments has also been 
scored. 

Three general observations can be made. First, 

the protein sequences of the LAV^ _ and LAV^ T are more 

EL I MkL 

divergent from LhV m{J than are those of HTLV-3 and ARV 2 

(Fig. 4A) ; similar results are obtained if ARV 2 is 

taken as reference (not shown). The range of genetic 

polymorphism between isolates of the AIDS virus is 

considerably greater than previously observed. Second, 

our two sequences confirm that the envelope is more 

variable than the gag and pol genes. Here again, the 

relatively small difference observed between the env of 

LAV BRU and HTLV-3 appears as an exception. Third, the 

mutual divergence of the LAV CT T and LAV^ T (Fig. 4B) is 

EL I MA L 

comparable to that between LAV^y and either of them; as 
far as we can extrapolate from only three sequenced 
isolates from the USA and Europe and two (LAV_ ' and 

£* LI 

LAV MAL ) from Africa, this is indicative of a wider 
evolution of the AIDS virus in Africa. 

gag and pol : Their greater degree of conservation 

compared to the envelope is consistent with their 
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encoding important structural or enzymatic activities. 
Of the three mature gag proteins, the p25 which was the 
first recognized immunogenic protein of LAV [Barr6- 
Sinoussi et al . , 1983] is also the better conserved 
(fig. 3). In gag and pol , differences between isolates 
are principally due to point mutations, and only a small 
number of insertional or deletional events is observed. 
Among these, we must note the presence in the over- 
lapping part of gag and pol of LAV BRU of an insertion of 
12 amino acids (AA) which is encoded by the second copy 
of a 36 bp direct repeat present only in this isolate 
and in HTLV-3. This duplication was omitted because of a 
computing error in the published sequence of LAV oorT 

B RU 

(position 1712, Wain-Hobson et al . , 1985) but was indeed 
present in the HTLV-3 sequences [Ratner et al . , 1985 ; 
Muesing et al . , 1985]. 

env : Three segments can be distinguished in the enve- 
lope glycoprotein precursor [Allan et al . , 19 85 ; 
Montagnier et al . , 1985 ; DiMar zoVeronese et al . , 1985]. 
The first is the signal peptide (positions 1-33 in fig. 
3), and its sequence appears as variable ; the second 
segment (pos. 34-530) forms the outer membrane protein 
(hereinafter "OMP" or "gp110" ) and carries most of the 
genetic variations, and in particular almost all of the 
numerous reciprocal insertions and deletions ; the third 
segment (531-877) is separated from the OMP by a poten- 
tial cleavage site following a constant basic stretch 
(Arg-Glu-Lys-Arg) and forms the transmembrane protein 
(hereinafter " TMP " or M gp 41") responsible for the an- 
chorage of the envelope glycoprotein in the cellular 
membrane. A better conservation of the TMP than the OMP 
has also been observed between the different murine 
leukemia viruses (hereinafter " MLV w ) [Koch et al., 1983] 
and could be due to structural constraints. 

From the alignment of figure 3 and the 
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graphical representation of the envelope variability 
shown in figure 5, we clearly see the existence of 
conserved domains, with little or no genetic variation, 
and hypervariable domains, in which even the alignment 
of the different sequences is very difficult, because of 
the existence of a large number of mutations and of 
reciprocal insertions and deletions. We have not 
included the sequence of the envelope of the HTLV-3 
isolate since it so close to that of LAV ODrt (cf. fig. 

BR U 

4), even in the hypervariable domains, that it did not 
add anything to the analysis. While this graphical 
representation will be refined by more sequence data, 
the general profile is already apparent, with three 
hypervariable domains (Hyl, 2 and 3) all being located 
in the 0MP and separated by three well-conserved 
stretches (residues 37-130, 211-289, and 488-530 of fig. 
3 alignment) probably associated with important biolo- 
gical functions. 

In spite of the extreme genetic variability, 
the folding pattern of the envelope glycoprotein is 
probably constant. Indeed the position of virtually all 
of the cysteine residues is conserved within the diffe- 
rent isolates (fig. 3 and 5), and the only three varia- 
ble cysteines fall either in the signal peptide or in 
the very C-terminal part of the TMP. The hypervariable 
domains of the OMP are bounded by conserved cysteines, 
suggesting that they may represent loops attached to the 
common folding pattern. Also the calculated hydropathic 
profiles [Kyte and Doolittle, 1982] of the different en- 
velope proteins are remarkably conserved (not shown). 

About half of the potential N-glycosylation 
sites, Asn-X-Ser/Thr , found in the envelopes of the 
Zairian isolates map to the same positions in L AV^ 
(17/26 for LAV ai and 17/28 for LAV^^ ) , The other sites 
appear to fall within variable domains of env, 
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suggesting the existence of differences in the extent of 
envelope glycosylation between different isolates. 

Qthgr — viral protein? : of the three other identified 

viral proteins, the p27 encoded by orf F r 3' of env 
[Allan et al., 1985b] is the most variable (fig. 4). The 
proteins encoded by orfs Q and S of the central region 
are remarkable by their absence of insertions/deletions. 
Surprisingly, a high frequency of amino acids substitu- 
tions, comparable to that observed in env, is found for 
the product of orf S (trans-activating factor). On the 
other hand, the protein encoded by orf Q is no more 
variable than gag. Also noticeable is the lower 
variation of the proteins encoded by the central regions 

Of LAV T and LAV M _ T . 

EL I MAL 

With the availability of the complete nucleo- 
tide sequence from five independant isolates, some 
general features of the AIDS virus 1 genetic variability 
are now emerging. Firstly, its principal cause is point 
mutations which very often result in amino acid substi- 
tutions and which are more frequent in the 3 ' part of 
the genome (orf S, env and orf F) . Like all RNA viruses, 
the retroviruses are thought to be highly subject to 
mutations caused by errors of the RNA polymerases during 
their replication, since there is no proofreading, of 
this step [Holland et al., 1982 ; Steinhauer and 
Holland, 1986] . 

Another source of genetic diversity is 
insertions/deletions. From the figure 3 alignments, in- 
sertional events seem to be implicated in most of the 
cases, since otherwise deletions should have occurred in 
independant isolates at precisely the same locations. 
Furthermore, upon analyzing these insertions, we have 
observed that they most often represent one of the two 
copies of a direct repeat (fig. 6). Some are perfectly 
conserved like the 36 bp repeat in the gag-pol overlap 
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of LAV^y (fig. 6-a) ; others carry point mutations 
resulting in amino acid substitutions, and as a conse- 
quence, they are more difficult to observe, though 
clearly present, in the hypervariable domains of env 
(cf. fig. 6-g and -h) . As noted for point mutations, env 
gene and orf F also appear as more susceptible to that 
form of genetic variation than the rest of the genome. 
The degree of conservation of these repeats must be 
related to their date of occurrence in the analyzed 
sequences : the more degenerated, the more ancient. A 
very recent divergence of LhV BR{J and HTLV-3 is suggested 
by the extremely low number of mismatched AA between 
their homologous proteins. However, one of the LAV nDTT 

B RU 

repeats (located in the Hyl domain of env, fig. 6-f) is 
not present in HTLV-3, indicating that this generation 
of tandem repeats is a rapid source of genetic diver- 
sity. We have found no traces of such a phenomenon, even 
when comparing very closely related viruses, such as the 
Mason-Pfizer monkey virus (hereinafter "MPMV") [Sonigo 
et al. # 1986], and an immunosuppressive simian virus 
(hereinafter "SRV-1" ) [Power et al . , 1986]. Insertion or 
deletion of one copy of a direct repeat have been occa- 
sionally reported in mutant retroviruses [Shimotohno and 
Temin, 1981 ; Darlix, 1986], but the extent to which we 
observe this phenomenon is unprecedented. The molecular 
basis of these duplications is unclear, but could be the 
"copy-choice" phenomenon, resulting from the diploidy of 
the retroviral genome [Varmus and Swanstrom, 1984 ; 
Clark and Mak, 1983]. During the synthesis of the first- 
strand of the viral DNA, jumps are known to occur from 
one RNA molecule to another, especially when a break or 
a stable secondary structure is present on the template; 
an inaccurate re-initiation on the other RNA template 
could result in the generation (or the elimination) of a 
short direct repeat. 
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Genetic variability and subsequent antigenic 
modifications have often been developed by micro- 
organisms as a means for avoiding the host's immune 
response, either by modifying their epitopes during the 
course of the infection, as in trypanosomes [Borst and 
Cross, 1982], or by generating a large repertoire of 
antigens, as observed in influenza virus [Webster et 
al., 1982]. As the human AIDS virus is related to animal 
lentiviruses [Sonigo et al . , 1985 ; Chiu et al., 1985], 
its genetic variability could be a source of antigenic 
variation, as can be observed during the course of the 
infection by the ovine lentivirus visna [Scott et al . , 
1979 ; Clements et al., 1980] or by the equine infec- 
tious anemia virus (hereinafter "EIAV") [Montelaro et 
al., 1984]. However, a major discrepancy with these 
animal models is the extremely low, and possibly 
nonexistant, neutralizing activity of the sera of 
individuals infected by the AIDS virus, whether they are 
healthy carriers, displaying minor symptoms, or 
afflicted with AIDS [Weiss et al . , 1985 ; Clavel et al., 
1985]. Furthermore, even for the visna virus the exact 
role of antigenic variation in the pathogenesis is 
unclear [Thormar et al., 1983 ; Lutley et al., 1983]. We 
rather believe that genetic variation represents a 
general selective advantage for lentiviruses by allowing 
an adaptation to different environments, for example by 
modifying their tissue or host tropisms. In the particu- 
lar case of the AIDS virus, rapid genetic variations are 
tolerated, especially in the envelope. This could allow 
the virus to become adapted to different "micro-environ- 
ments" of the membrane of their principal target cells, 
namely the T4 lymphocytes . These "micro-environments 0 
could result from the immediate vicinity of the virus 
receptor to polymorphic surface proteins, differing 
either between individuals or between clones of 
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lymphocytes . 

Conserved domains in the AIDS virus envelop* 



Since the proteins of most of the isolates are 
antigenically cross-reactive, the genotypic differences 
do not seem to affect the sensitivity of actual diagnos- 
tic tests, based upon the detection of antibodies to the 
AIDS virus and using purified virions as antigens. They 
nevertheless have to be considered for the development 
of the "second-generation" tests, that are expected to 
be more specific, and will use smaller synthetic or 
genetically-engineered viral antigens. The identifi- 
cation of conserved domains in the highly immunogenic 
envelope glycoprotein and the core structural proteins 
(gag) is very important for these tests. The conserved 
stretch found at the end of the OMP and the beginning of 
the TMP (490-620, fig. 3) could be a good candidate, 
since a bacterial fusion protein containing this domain 
was well-detected by AIDS patients' sera [Chang et al . , 
1985]. 

The envelope, specifically the OMP, mediates 
the interaction between a retrovirus and its specific 
cellular receptor [DeLarco and Todaro, 1976 ; Robinson 
et al., 1980]. In the case of the AIDS virus, jln vitro 
binding assays have shown the interaction of the enve- 
lope glycoprotein gp110 with the T4 cellular surface 
antigen [McDougal et al . , 1986], already thought to be 
closely associated with the virus receptor [Klatzmann et 
al., 1984 ; Dagleish et al . , 1984]. Identification of 
the AIDS virus envelope domains that are responsible for 
this interaction (receptor-binding domains) appears to 
be fundamental for understanding of the host-viral 
interactions and for designing a protective vaccine, 
since an immune response against these epitopes could 
possibly elicit neutralizing antibodies. As the AIDS 
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virus receptor is at least partly formed of a constant 
structure, the T4 antigen, the binding site of the 
envelope is unlikely to be exclusively encoded by 
domains undergoing drastic genetic changes between 
isolates, even if these could be implicated in some kind 
of an "adaptation". One or several of the conserved 
domains of the OMP (residues 37-130, 211-289, and 
488-530 of fig. 3 alignment), brought together by the 
folding of the protein, must play a part in the virus- 
receptor interaction, and this can be explored with 
synthetic or genetically-engineered peptides derived 
from these domains, either by direct binding assays or 
indirectly by assaying the neutralizing activity of 
specific antibodies raised against them. 

African AIDS viruses 

Zaire and the neighbouring countries of 
Central Africa are considered as an area endemic with 
the AIDS virus infection, and the possibility that the 
virus has emerged in Africa has became a subject of 
intense controversy (see Norman, 1985). From the present 
study, it is clear that the genetic organization of 
Zairian isolates is the same as that of american 
isolates, thereby indicating a common origin. The very 
important sequence differences observed between the 
proteins are consistent with a divergent evolutionary 
process. In addition, the two African isolates are 
mutually more divergent than the American isolates 
already analyzed ; as far as that observation can be 
extrapolated, it suggests a longer evolution of the 
virus in Africa and is also consistent with the fact 
that a larger fraction of the population is exposed than 
in developed countries. 

A novel human retrovirus with morphology and 
biologocal properties (cytopathogenicity , T4 tropism) 
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similar to those of LAV, but nevertheless clearly 
genetically and antigenically distinct from it, was 
recently isolated from two patients with AIDS 
originating from Guinea Bissau, West-Africa [Clavel et 
al., 1986]. In neighboring Senegal, the population was 
seemingly exposed to a retrovirus also distinct from LAV 
but apparently non-pathogenic [Barin et al . , 1985 ; 
Kanki et al., 1986]. Both of these novel African retro- 
viruses seem to be antigenically related to the simian 
T-cell lymphotropic virus ( hereinafter " STLV— I II 11 ) shown 
to be widely present in healthy African green monkeys 
and other simian species [Kanki et al . 1985]. This 
raises the possibility of a large group of African 
primate lentiviruses , ranging from the apparently 
non-pathogenic simian viruses to the LAV-type viruses. 
Their precise relationship will only be known after 
their complete genetic characterization, but it is 
already very likely that they have evolved from a common 
progenitor. The important genetic variability we have 
observed between isolates of the AIDS virus in Central 
Africa is probably a hallmark of this entire group and 
may account for the apparently important genetic 
divergence between its members (loss of 

cross-antigenicity in the envelopes). In this sense, the 
conservation of the tropism for the T4 lymphocytes 
suggests that it is a major advantage aquired by these 
retroviruses. 

EXPERIMENTAL ppQCFDUfiES 
Virus Isolation 

LAV^j^ was isolated from the peripheral blood 
lymphocytes of the patient as described [Barre-Sinoussi 
et al., 1983]. Briefly, the lymphocytes were fractiona- 
ted and co-cultivated with phytohaemagglutinin-stimula- 
ted normal human lymphocytes in the presence of 
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inter leukin 2 and anti-alpha interferon serum. Viral 
production was assessed by cell-free reverse 
transcriptase (hereinafter "RT" ) activity assay in the 
cultures and by electron microscopy. 1 
IPtolecmlajr clomiLmwy 

Normal donor lymphocytes were acutely infected 
(TO 4 cpm of RT activity/10 6 cells) as described [Barre- 
Sinoussi et al., 1983] , and total DNA was extracted at 
the beginning of the RT activity peak. A lambda library 
using the L47-1 vector [Loenen and Brammar, 1982] was 
constructed by partial Hindlll digestion of the DNA as 
already described [Alizon et al., 1984]. DNA from 
infected cells was digested to completion with Hindlll, 
and the 9-10kb fraction was selected on 0.8 % low 
melting point agarose gel and ligated into L47-1 Hindlll 
arms. About 2.10 5 plaques for LAV M ._, obtained by in 
vitro packaging (Amersham) , were plated on JL. coli LA101 
and screened in situ under stringent conditions , using 
the 9 kb Sad insert of the clone lambda J19 [Alizon et 
al., 1984] carrying most of the LAV DDTT genome as probe. 
Clones displaying positive signals were plaque-purified 
and propagated on E. coli C600 recBC, and the recombi- 
nant phage M-H11 carrying the complete genetic 
information of lav m^l was ^ urther characterized by 
restriction mapping. 

Btocleotide seupnemnce strategy 

Viral fragments derived from M-H11 were 
sequenced by the dideoxy chain terminator procedure 
[Sanger et al . , 1977] after "shotgun" cloning in the 
M13mp8 vector [Messing and Viera, 1982] as previously 
described [Sonigo et al . , 1985]. The viral genome of 
LAV MAL is 9229 nucleotides long as shown in figs. 7A-7I. 
Each nucleotide of LAV^^ was determined from more than 
5 independent clones on average. 

SIGNIFICANCE OF THE FIOTMES 
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Figmire 11 contains an analysis of AIDS virus isolates, 
showing: 

A/ Restriction maps of the inserts of phage 
lambda clones derived from cells infected with LAV^j 
(hereinafter "E-H12" ) and with l>M mL (M-H11). The 
schematic genetic organization of the AIDS virus has 
been drawn above the maps. The LTRs are indicated by 
solid boxes. Restriction sites are indicated as follows: 
A:Aval; B : BamHI ; Bg:BgIII; E:EcoRI; H:HindIII; 

Hc:HincII; K:KpnI; N:NdeI; P:PstI; S:SacI; and X:XbaI. 
Asterisks indicate the Hindlll cloning sites in lambda 
L47- 1 vector . 

B/ A comparison of the sites for seven 
restriction enzymes in six isolates : the prototype AIDS 

virus LAV BRU , LAV MAL and lav eli ; and Z1# 22 and 7,3 ' 
Restriction sites are represented by. the following 
symbols vertically aligned wih the symbols in fig. 1A: 
Bglll; * :EcoRI; V . Hindi ; r : Hindll I ; ♦ : Kpnl ; O : Ndel ; 
and o : Sad . 

Figruure 2 shows the genetic organization of the central 
region in AIDS virus isolates. Stop codons in each phase 
are represented as vertical bars. Vertical arrows indi- 
cate possible AUG initiation codons. Splice acceptor (A) 
and donor (D) sites identified in subgenomic viral mRNA 
[Muesing et al . , 1985] are shown below the graphic of 
LAV BRU' and corres P° ndin 9 sites in LAV EL1 and I*AV MAL are 
indicated. PPT indicates the repeat of the polypurine 
tract flanking the 3 ' LTR . As observed in LAV 0 _ TT 
[Wain-Hobson et al . , 1985], the PPT is repeated 256 
nucleotides 5 ' to the end of the pol gene in both the 
LAV EL1 and LAV MAL sequences , but this repeat is 

degenerated at two positions in LAV_ _.. 

hJ-t I 

Figure 3 shows an alignment of the protein sequences of 
four AIDS virus isolates. Isolate LAV [Wain-Hobson et 
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al., 1985] is taken as reference ; only differences with 
LAV BRU are noted for ARV 2 [Sanchez-Pescador et al . , 
1985] and the two Zairian isolates LAV and LAV W . A 

- MA L EJj I 

minimal number of gaps (-) were introduced in the 
alignments. The NH 2 -termini of p25 gag and p18 gag are 
indicated [Sanchez-Pescador, 1985], The potential 
cleavage sites in the envelope precursor [Allan et al., 
1985a ; diMarzoVeronese, 1985] separating the signal 
peptide (hereinafter "SP"), OMP and TMP are indicated as 
vertical arrows ; conserved cysteines are indicated by 
black circles and variable cysteines are boxed. The one 
letter code for each amino acid is as follows: A: Ala ; 
C:Cys ; D:Asp ; E:Glu ; F : Phe ; G : Gly ; H :His ; I : He ; 
K : Lys ; L : Leu ; M:Met ; N : Asn ; P : Pro ; Q:Gln ; R : Arg ; 
S:Ser ; T:Thr ; V:Val; W:Trp ; Y : Tyr . 

Fi<gure 4 shows a quantitation of the sequence divergence 
between homologous proteins of different isolates. Part 
A of each table gives results deduced from two-by-two 
alignments using the proteins of LAV OOM as reference, 

o KU 

part B, those of LAV^j as reference. Sources: Muesing 
et al., 1985 for HTLV-3 ; Sanchez-Pescador et al., 1985 
for ARV 2 and Wain-Hobson et al . , 1985 for LAV n . For 

B RU 

each case in the tables, the size in amino acids of the 
protein (calculated from the first methionine residue or 
from the beginning of the orf for pol) is given at the 
upper left part. Below are given the number of deletions 
(left) and insertions (right) necessary for the align- 
ment. The large numbers in bold face represent the 
percentage of amino acids substitutions (insertions/de- 
letions being excluded). Two by two alignments were done 
with computer assistance [Wilburg and Lipman, 1983], 
using a gag penalty of 1, K-tuple of 1, and window of 
20, except for the hypervariable domains of env, where 
the number of gaps was made minimum, and which are 
essentially aligned as shown in fig. 3. The sequence of 



the predicted protein encoded by orf R of HTLV-3 has not 
been compared because of a premature termination relati- 
ve to all other isolates. 

Fitgruure 5 shows the variability of the AIDS virus 
envelope protein. For each position x of the alignment 
of env (Fig. 3), variability V(x) was calculated as: 
V(x) = number of different amino-acids at position x/ 
frequency of the most abundant amino acid at position x. 
Gaps in the alignments are considered as another amino 
acid. For an alignment of 4 proteins, V(x) ranges from 1 
(identical AA in the 4 sequences) to 16 (4 different 
AA) . This type of representation has previously been 
used in a compilation of the AA sequence of immunoglo- 
bulins variable regions [Wu and Kabat, 1970], Vertical 
arrows indicate the cleavage sites ; asterisks represent 
potential N-glysosylation sites (N-X-S/T) conserved in 
all three four solates ; black triangles represent 
conserved cysteine residues. Black lozanges mark the 
three major hydrophobic domains: OMP, TMP and SP; and 
the hypervariable domains: Hyl , 2 and 3. 

Figure 6 shows the direct repeats in the proteins of 
different AIDS virus isolates. These examples are 
derived from the aligned sequences of gag (a, b) , F 
(c,d) and env (e, f, g, h) shown in figure 3. The two 
elements of the direct repeat are boxed, while degene- 
rated positions are underlined. 

Figures 7A-7I show the complete cDNA sequence of LAV hJfm T 
of this invention. 

The invention thus pertains more specifically 
to the proteins, glycoproteins and other polypeptides 
including the polypeptidic structures shown in the 
figures 1-7. The first and last amino acid residues of 
these proteins, glycoproteins and polypeptides carry 
numbers computed from a first amino acid of the 
open-reading frames concerned, although these numbers do 
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not correspond exactly to those of the LhV WiL proteins 
concerned, rather to the corresponding proteins of the 
LAV BRU or se< 5 uences shown in figs. 3A, 3B and 3C. Thus a 
number corresponding to a "first. amino acid residue" of 

5 

a lav MA l P rote i n corresponds to the number of the first' 
amino-acyl residue of the corresponding LAV protein 

BR U 

which, in any of figs. 3A f 3B or 3C, is in direct 
alignment with the corresponding first amino acid of the 
LAVjy^ protein. Thus the sequences concerned can be read 
from figs. 7A-7I to the extent where they do not appear 
with sufficient clarity from Figs. 3A-3F. 

The preferred protein sequences of this 
invention extend between the corresponding "first" and 
"last" amino acid residues. Also preferred are the 
protein(s)- or glycoprotein ( s ) -portions including part 
of the sequences which follow : 

OMP or gp110 proteins, including precursors : 
1 to 530 

OMP or gp110 without precursor : 
34-530 

Sequence carrying the TMP or gp41 protein : 
531-877, particularly 
680-700 

well conserved stretches of OMP : 

37-130, 
211-289 and 
488-530 

well conserved stretch found at the end of the OMP and 
the beginning of TMP : 

490-620. 

Proteins containing or consisting of the "well 
conserved stretches" are of particular interest for the 
production of immunogenic compositions and (preferably 
in relation to the stretches of the env protein) of 
vaccine compositions against the LAV-1 viruses. 
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The invention concerns more particularly all 
the DNA fragments which have been more specifically 
referred to in the drawings and which correspond to open 
reading frames. It will be understood that one skilled 
in the art will be able to obtain them all, for instance 
by cleaving an entire DNA corresponding to the complete 
genome of LAV^^ ( such as by cleavage by a partial or 
complete digestion thereof with a suitable restriction 
enzyme and by the subsequent recovery of the relevant 
fragments. The DNA disclosed above can be resorted to 
also as a source of suitable fragments. The techniques 
disclosed in PCT application for the isolation of the 
fragments which can then be included in suitable 
plasmids are applicable here too. Of course, other 
methods can be used, some of which have been exemplified 
in European Application No. 178,978, filed September 17, 
1985. Reference is for instance made to the following 
methods : 

a) DNA can be transfected into mammalian cells 
with appropriate selection markers by a variety of tech- 
niques, such as calcium phosphate precipitation, 
polyethylene glycol, protoplast-fusion, etc. 

b) DNA fragments corresponding to genes can be 
cloned into expression vectors for E. coli, yeast- or 
mammalian cells and the resultant proteins purified. 

c) The provival DNA can be "shot-gunned" 
(fragmented) into procaryotic expression vectors to 
generate fusion polypeptides. Recombinants, producing 
antigenically competent fusion proteins, can be identi- 
fied by simply screening the recombinants with anti- 
bodies against LAV antigens. 

The invention further refers to DNA recombi- 
nants, particularly modified vectors, including any of 
the preceding DNA sequences adapted to transform corres- 
ponding microorganisms or cells, particularly eucaryotic 
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cells such as yeasts, for instance Saccharomvces 
Cerevisiae, or higher eucaryotic cells, particularly 
cells of mammals, and to permit expression of said DNA 
sequences in the corresponding microorganisms or cells. 
General methods of that type have been recalled in the 
abovesaid PCT international patent aplication PCT/EP 
85/00548, filed October 18, 1985. 

More particularly the invention relates to 
such modified DNA recombinant vectors modified by the 
abovesaid DNA sequences and which are capable of trans- 
forming higher eucaryotic cells particularly mammalian 
cells. Preferably, any of the abovesaid sequences are 
placed under the direct control of a promoter contained 
in said vectors and recognized by the polymerases of 
said cells, such that the first nucleotide codons 
expressed correspond to the first triplets of the 
above-defined DNA sequences . Accordingly, this invention 
also relates to the corresponding DNA fragments which 
can be obtained from the genome of LAV^^ or its cDNA by 
any appropriate method. For instance, such a method 
comprises cleaving said LAV MAL genome or its cDNA by 
restriction enzymes preferably at the level of restric- 
tion sites surrounding said fragments and close to the 
opposite extremities respectively thereof, recovering 
and identifying the fragments sought according to sizes, 
if need be checking their restriction maps or nucleotide 
sequences (or by reaction with monoclonal antibodies 
specifically directed against epitopes carried by the 
polypeptides encoded by said DNA fragments), and further 
if need be, trimming the extremities of the fragment, 
for instance by an exonucleolytic enzyme such as Bal31, 
for the purpose of controlling the desired nucleotid- 
sequences of the extremities of said DNA fragments or, 
conversely, repairing said extremities with Klenow 
enzyme and possibly ligating the latter to synthetic 



polynucleotide fragments designed to permit the 
reconstitution of the nucleotide extremities of said 
fragments. Those fragments may then be inserted in any 
of said vectors for causing the expression of the 
corresponding polypeptide by the cell transformed there- 
with. The corresponding polypeptide can then be recove- 
red from the transformed cells, if need be after lysis 
thereof, and purified by methods such as electrophore- 
sis. Needless to say, all conventional methods for 
performing these operations can be resorted to. 

The invention also relates more specifically 
to cloned probes which can be made starting from any DNA 
fragment according to this invention, thus to recombi- 
nant DNAs containing such fragments, particularly any 
plasmids amplifiable in procaryotic or eucaryotic cells 
and carrying said fragments. Using the cloned DNA 
fragments as a molecular hybridization probe - either by 
labelling with radionucleotides or with fluorescent 
reagents - LAV virion RNA may be detected directly in 
the blood, body fluids and blood products (e.g. of the 
antihemophylic factors such as Factor VIII concentrates) 
and vaccines (e.g., hepatitis B vaccine). It has already 
been shown that whole virus can be detected in culture 
supernatants of LAV producing cells. A suitable method 
for achieving that detection comprises immobilizing 
virus on a support (e.g., a nitrocellulose filter ) , dis- 
rupting the virion and hybridizing with labelled 
(radiolabeled or "cold" fluorescent- or enzyme-label- 
led) probes. Such an approach has already been developed 
for Hepatitis B virus in peripheral blood [SCOTTO J. et 
al. Hepatology (1983), 3, 379-384]. 

Probes according to the invention can also be 
used for rapid screening of genomic DNA derived from the 
tissue of patients with LAV related symptoms, to see if 
the proviral DNA or RNA present in host tissue and other 



tissues can be related to that of LAV. 
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A method which can be used for such screening 
comprises the following steps : extraction of DNA from 
tissue, restriction enzyme cleavage of said DNA, 
electrophoresis of the fragments and Southern blotting 
of genomic DNA from tissues, subsequent hybridization 
with labelled cloned LAV proviral DNA . Hybridization in 
situ can also be used. 

Lymphatic fluids and tissues and other non- 
lymphatic tissues of humans, primates and other 
mammalian species can also be screened to see if other 
evolutionnary related retrovirus exist. The methods 
referred to hereinabove can be used, although hybridi- 
zation and washings would be done under non-stringent 
conditions . 

The DNAs or DNA fragments according to the 
invention can be used also for achieving the expression 
of viral antigens of LAV MAL for diagnostic purposes. 

The invention relates generally to the poly- 
peptides themselves, whether synthesized chemically, 
isolated from viral preparations or expressed by the 
different DNAs of the invention, particularly by the 
ORFs or fragments thereof in appropriate hosts, par- 
ticularly procaryotic or eucaryotic hosts, after trans- 
formation thereof with a suitable vector previously 
modified by the corresponding DNAs. 

More generally, the invention also relates to 
any of the polypeptide fragments (or molecules, particu- 
larly glycoproteins having the same polypeptidic back- 
bone as the polypeptides mentioned hereinabove) bearing 
an epitope characteristic of a protein or glycoprotein 
of LAV MAL , which polypeptide or molecule then has 
N-terminal and C-terminal extremities respectively 
either free or, independently from each other, cova- 
lently bonded to amino acids other than those which are 
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normally associated with them in the larger polypeptides 
or glycoproteins of the LAV virus, which last mentioned 
amino acids are then free or belong to another polypep- 
tide sequence. Particularly, the invention relates to 
hybrid polypeptides containing any of the epitope- 
bearing-polypeptides which have been defined more speci- 
fically hereinabove, recombined with other polypeptides 
fragments normally foreign to the LAV proteins, having 
sizes sufficient to provide for an increased immunogeni- 
city of the epitope-bearing-polypeptide yet, said 
foreign polypeptide fragments either being immunogeni- 
cally inert or not interfering with the immunogenic 
properties of the epitope-bearing-polypeptide . 

Such hybrid polypeptides, which may contain 
from 5 up to 150, even 250 amino acids, usually consist 
of the expression products of a vector which contained 
ab initio a nucleic acid sequence expressible under the 
control of a suitable promoter or replicon in a suitable 
host, which nucleic acid sequence had however beforehand 
been modified by insertion therein of a DNA sequence 
encoding said epitope-bearing-polypeptide. 

Said epitope-bearing-polypeptides, particular- 
ly those whose N-terminal and C-terminal amino acids are 
free, are also accessible by chemical synthesis accord- 
ing to technics well known in the chemistry of proteins. 

The synthesis of peptides in homogeneous 
solution and in solid phase is well known. In this 
respect, recourse may be had to the method of synthesis 
in homogeneous solution described by Houbenweyl in the 
work entitled "Methoden der Organischen Chemie M (Methods 
of Organic Chemistry) edited by E. WUNSCH. , vol. 15-1 
and II, THIEME, Stuttgart 1974. This method of synthesis 
consists of successively condensing either the 
successive amino acids in twos, in the appropriate order 
or successive peptide fragments previously available or 



formed and containing already several amino-acyl 
residues in the appropriate order respectively. Except 
for the carboxyl and aminogroups which will be engaged 
in the formation of the peptide bonds, care must be 
taken to protect beforehand all other reactive groups 
borne by these amino-acyl groups or fragments . However , 
prior to the formation of the peptide bonds, the 
carboxyl groups are advantageously activated, according 
to methods well known in the synthesis of peptides . 
Alternatively, recourse may be had to coupling reactions 
bringing into play conventional coupling reagents, for 
instance of the carbodiimide type, such as 
1 -ethyl -3- ( 3 -dimethyl -amino- propyl ) -carbodiimide . When 
the amino acid group used carries an additional amine 
group (e.g., lysine) or another acid function (e.g., 
glutamic acid), these groups may be protected by 
carbobenzoxy or t-butyloxycarbonyl groups, as regards 
the amine groups, or by t-butylester groups, as regards 
the carboxylic groups. Similar procedures are available 
for the projection of other reactive groups. for 
example, an -SH group (e.g., in cysteine) can be 
protected by an acetamidomethyl or paramethoxybenzyl 
group. 

In the case of a progressive synthesis, amino 
acid by amino acid, the synthesis starts preferably with 
the condensation of the C-terminal amino acid with the 
amino acid which corresponds to the neighboring amino- 
acyl group in the desired sequence and so on, step by 
step, up to the N-terminal amino acid. Another preferred 
technique which can be used is that described by R.D. 
Merrifield in "Solid Phase Peptide Synthesis" [J. Am. 
Chem. Soc, 45, 2149-2154]. In accordance with the 
Merrifield process, the first C-terminal amino acid of 
the chain is fixed to a suitable porous polymeric resin, 
by means of its carboxylic group, the amino group of the 
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amino acid then being protected, for example by a 
t-butyloxycarbonyl group. When the first C- terminal 
amino acid is thus fixed to the resin, the protective 
group of the amine group is removed by washing the resin 
with an acid, i.e., trif luoroacetic acid, when the 
protective group of the amine group is a t-butyloxycar- 
bonyl group. Then, the carboxylic group of the second 
amino acid, which is to provide the second amino-acyl 
group of the desired peptidic sequence, is coupled to 
the deprotected amine group of the C-terminal amino acid 
fixed to the resin. Preferably, the carboxyl group of 
this second amino acid has been activated, for example 
by dicyclohexyl-carbodiimide, while its amine group has 
been protected, for example by a t-butyloxycarbonyl 
group. The first part of the desired peptide chain, 
which comprises the first two amino acids, is thus 
obtained. As previously, the amine group is then de- 
protected, and one can further proceed with the fixing 
of the next amino-acyl group and so forth until the 
whole peptide sought is obtained. The protective groups 
of the different side groups, if any, of the peptide 
chain so formed can then be removed. The peptide sought 
can then be detached from the resin, for example by 
means of hydrofluoric acid, and finally recovered in 
pure form from the acid solution according to 
conventional procedures . 

As regards the peptide sequences of smallest 
size bearing an epitope or immunogenic determinant, and 
more particularly those which are readily accessible by 
chemical synthesis, it may be required, in order to 
increase their in vivo immunogenic character, to couple 
or "conjugate" them covalently to a physiologically 
acceptable and non-toxic carrier molecule. By way of 
examples of carrier molecules or macromolecular supports 
which can be used for making the conjugates according to 
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the invention can be mentioned natural proteins, such as 
tetanic toxoid, ovalbumin, serum-albumins, hemocyanins, 
etc. Synthetic macromolecular carriers, for example 
polysines or poly(D-L-alanine)-poly(L-lysine)s, can be 
used too. Other types of macromolecular carriers that 
can be used, which generally have molecular weights 
higher than 20,000, are known from the literature. The 
conjugates can be synthesized by known processes such as 
are described by Frantz and Robertson in "Infect, and 
Immunity", 33, 193-198 (1981) and byP.E. Kauffmanin 
"Applied and Environmental Microbiology", October 1981 
Vol. 42, No. 4, pp. 611-614. For instance, the following 
coupling agents can be used : glutaric aldehyde, ethyl 
chlorof ormate, water-soluble carbodiimides such 
as ( N- ethyl -N ' ( 3-dimethylamino-propyl ) carbodiimide , 

HC1), diisocyanates, bis-diazobenzidine, di- and 
trichloro-s-triazines , cyanogen bromides and 

benzaquinone, as well as the coupling agents mentioned 
in "Scand. J. Immunol.", 1978, vol. 8, pp. 7-23 
(Avrameas, Ternynck, Guesdon) . 

Any coupling process can be used for bonding 
one or several reactive groups of the peptide, on the 
one hand, and one or several reactive groups of the 
carrier, on the other hand. Again coupling is advanta- 
geously achieved between carboxyl and amine groups 
carried by the peptide and the carrier or vice- versa in 
the presence of a coupling agent of the type used in 
protein synthesis, e.g., 1 -ethyl-3- ( 3-dimethylaminopro- 
pyl ) -carbodiimide , N-hydroxybenzotriazole , etc . Coupling 
between amine groups respectively borne by the peptide 
and the carrier can also be made with glutaraldehyde , 
for instance, according to the method described by 
BOQUET, P. et al. (1982) Molec. Immunol., 12, 1441-1549, 
when the carrier is hemocyanin. 

The immunogenicity of epitope-bearing-peptides 



can also be reinforced by oligomerisation thereof, for 
example in the presence of glutaraldehyde or any other 
suitable coupling agent. In particular, the invention 
relates to the water soluble immunogenic oligomers thus 
obtained, comprising particularly from 2 to 10 monomer 
units . 

The glycoproteins, proteins and other polypep- 
tides (generally designated hereinafter as "antigens" of 
this invention) whether obtained by methods, such as are 
disclosed in the earlier patent applications referred to 
above, in a purified state from LAV^^ virus prepara- 
tions or - as concerns more particularly the peptides - 
by chemical synthesis, are useful in processes for the 
detection of the presence of anti-LAV antibodies in 
biological media, particularly biological fluids such as 
sera from man or animal, particularly with a view of 
possibly diagnosing LAS or AIDS. 

Particularly the invention relates to an in 
vitro process of diagnosis making use of an envelope 
glycoprotein or of a polypeptide bearing an epitope of 
this glycoprotein of LAV MAL for the detection of anti- 
LAV antibodies in the serums of persons who carry them. 
Other polypeptides - particular those carrying an epito- 
pe of a core protein - can be used too. 

A preferred embodiment of the process of the 
invention comprises : 

- depositing a predetermined amount of one or several of 
said antigens in the cups of a titration microplate ; 

introducing increasing dilutions of the biological 
fluid, to be diagnosed (e.g., blood serum, spinal fluid, 
lymphatic fluid, and cephalo-rachidian fluid), into 
these cups ; 

- incubating the microplate ; 

- washing carefully the microplate with an appropriate 
buffer ; 



- adding into the cups specific labelled antibodies di- 
rected against blood immunoglobulins and 

- detecting the antigen-antibody-complex formed, which 
is then indicative of the presence of LAV antibodies in 
the biological fluid . 

Advantageously the labelling of the anti-immu- 
noglobulin antibodies is achieved by an enzyme selected 
from among those which are capable of hydrolysing a 
substrate, which substrate undergoes a modification of 
its radiation-absorption, at least within a 
predetermined band of wavelenghts. The detection of the 
substrate, preferably comparatively with respect to a 
control, then provides a measurement of the potential 
risks, or of the effective presence, of the disease. 

Thus, preferred methods of immuno-enzymatic 
and also immunof luorescent detections, in particular 
according to the ELISA technique, are provided. 
Titrations may be determinations by immunofluorescence 
or direct or indirect immuno-enzymatic determinations. 
Quantitative titrations of antibodies on the serums 
studied can be made. 

The invention also relates to the diagnostic 
kits themselves for the in vitro detection of antibodies 
against the LAV virus, which kits comprise any of the 
polypeptides identified herein and all the biological 
and chemical reagents, as well as equipment , necessary 
for peforming diagnostic assays. Preferred kits comprise 
all reagents required for carrying out ELISA assays. 
Thus preferred kits will include, in addition to any of 
said polypeptides, suitable buffers and anti-human immu- 
noglobulins, which anti-human immunoglobulins are label- 
led either by an immunof luorescent molecule or by an 
enzyme. In the last instance, preferred kits also com- 
prise a substrate hydrolysable by the enzyme and provid- 
ing a signal, particularly modified absorption of a 



radiation, at least in a determined wavelength, which 

signal is then indicative of the presence of antibody in 

the biological fluid to be assayed with said kit. 

It can of course be of advantage to use 

several proteins or polypeptides not only of LAV.i_" . but 

MAL 

also of LAV ELI together with homologous proteins or po- 
lypeptides of earlier described viruses, such as LAV 
HTLV-3, ARV 2, etc. 

The invention also relates to vaccine composi- 
tions whose active principle is to be constituted by any 
of the antigens, i.e., the hereinabove disclosed poly- 
peptides of LAV mL ' particularly the purified gp110 or 
immunogenic fragments thereof, fusion polypeptides or 
oligopeptides in association with a suitable pharmaceu- 
tical^ or physiologically acceptable carrier. A first 
type of preferred active principle is the gp110 
immunogen of said immunogens . Other preferred active 
principles to be considered in that fields consist of 
the peptides containing less than 250 amino acid units, 
preferably less than 150, particularly from 5 to 150 
amino acid residues, as deducible for the complete 
genome of LAV MAL and even more preferably those peptides 
which contain one or more groups selected from Asn-X-Thr 
and Asn-X-Ser as defined above. Preferred peptides for 
use in the production of vaccinating principles are 
peptides (a) to (f) as defined above. By way of example, 
there may be mentioned that suitable dosages of the 
vaccine compositions are those which are effective to 
elicit antibodies in vivo , in the host, particularly a 
human host. Suitable doses range from 10 to 500 
micrograms of polypeptide, protein or glycoprotein per 
kg, for instance 50 to 100 micrograms per kg. 

The different peptides according to this in- 
vention can also be used themselves for the production 
of antibodies, preferably monoclonal antibodies specific 
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for the respective different peptides. For the produc- 
tion of hybridomas secreting said monoclonal antibodies, 
conventional production and screening methods can be 
used. These monoclonal antibodies, which themselves are 
part of the invention, provide very useful tools for the 
identification and even determination of relative pro- 
portions of the different polypeptides or proteins in 
biological samples, particularly human samples contain- 
ing LAV or related viruses. 

The invention further relates to the hosts 
(procaryotic or eucaryotic cells) which are transformed 
by the above mentioned recombinants and which are capa- 
ble of expressing said DNA fragments. 

Finally the invention also concerns vectors 
for transforming eucaryotic cells of human origin, par- 
ticularly lymphocytes, the polymerase of which are 
capable of recognizing the LTRs of LAV . Particularly 
said vectors are characterized by the presence of a LAV 
LTR therein, said LTR being then active as a promoter 
enabling the efficient transcription and translation in 
a suitable host of a DNA insert coding for a determined 
protein placed under its controls. 

Needless to say, the invention extends to all 
variants of genomes and corresponding DNA fragments 
(ORFs) having substantially equivalent properties, all 
of said genomes belonging to retroviruses which can be 
considered as equivalents of LAV^^ It must be under- 
stood that the claims which follow are also intended to 
cover all equivalents of the products (glycoproteins, 
polypeptides, DNAs, etc.) whereby an equivalent is a 
product, e.g., a polypeptide, which may distinguish from 
a product defined in any of said claims, say through one 
or several amino acids, while still having substantially 
the same immunological or immunogenic properties. A 
similar rule of equivalency shall apply to the DNAs , it 



being understood that the rule of equivalency will then 
be tied to the rule of equivalency pertaining to the 
polypeptides which they encode. 

It will also be understood that all the 
literature referred to hereinbefore and hereinafter and 
all patent applications and patents not specifically 
identified herein but which form counterparts of those 
specifically designated herein, must be considered as 
incorporated herein by reference. 

It should further be mentioned that the 
invention further relates to immunogenic compositions 
that contain preferably one or more of the polypeptides, 
which are specifically identified above and which have 
the amino acid sequences of LAV mal that have been 
identified, or peptidic sequences corresponding to 
previously defined LAV proteins. In this respect, the 
invention relates more particularly to the particular 
polypeptides which have the sequences corresponding more 
specifically to the LAV^^ sequences which have been 
referred to earlier, i.e., the sequences extending 
between the following first and last amino acids, of the 
LAV ODT1 proteins themselves, i.e., the polypeptides 
having sequences contained in the LAV^y OMP or LAV fiRU 
TMP or sequences extending over both, particularly those 
extending from between the following positions of the 
amino acids included in the env open reading frame of 
the LAV^y genome, 

1-530 
34-530 
and more preferably 

531-877, particularly 680-700, 
37-130 

211-289 

488-530 

490-620. 
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These different sequences can be used for any 
of the above defined purposes and in any of the compo- 
sitions which have been disclosed. 
g Finally the invention also relates to the 

different antibodies which can be formed specifically 
against the different peptides which have been disclosed 
herein, particularly to the monoclonal antibodies which 
recognize them specifically. The corresponding hybri- 
1Q domas which can be formed starting from spleen cells 
previously immunized with such peptides which are fused 
with appropriate myeloma cells and selected according to 
standard procedures also form part of the invention. 

Phage A clone E-H12 derived from LAV ELI 
15 infected cells has been deposited at the CNCM under No. 
1-550 on May 9, 1986. Phage clone M-H11 derived from 
LAV MAL infe cted cells has been deposited at the CNCM 
under No. 1-551 on May 9, 1986. 
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Claims : 

1. The virus LAV^^ comprising RNA correspond- 
ing to the cDNA of figs. 7A-7I. 

2. The cDNA of figs. 7A-7I. 

3. A DNA recombinant comprising the cDNA of 

claim 2. 

4. A probe containing a nucleic acid sequence 
hybridizable with RNA of said ^^ mL virus of claim 1. 

5. A method for identifying the presence in a 
host tissue of LAV virus which comprises hybridizing RNA 
obtained from said tissue with said probe of claim 4. 

6. The method of claim 5, wherein said probe 

can hybridize with RNA from said LAV^ T virus to iden- 

MA L 

tify said LAV^ L virus. 

7. A peptide or fragment thereof whose amino 
acid sequence is encoded by an open reading frame of a 
cDNA sequence of the LAV MAL virus of claim 1 . 

8. The peptide of claim 7 encoded by a cDNA 
sequence from amino-acyl residue 37 to amino-acyl 
residue 130, or from amino-acyl residue 211 to 
amino-acyl residue 289, or from amino-acyl residue 488 
to amino-acyl residue 530 of figs. 3A-3F and 7A-7I. 

9. The peptide of claim 7 encoded by a cDNA 
sequence from amino-acyl residue 490 to amino-acyl 
residue 620 or from amino-acyl residue 680 to amino-acyl 
residue 700 of figs. 3A-3F and 7A-7I. 

10. The peptide of claim 7 which comprises a 
protein or glycoprotein whose amino acid sequence is 
encoded by all or part of one of the following cDNA 
sequences of figs. 3A-3F and 7A-7I: 

0MP or gp110 proteins, including precursors: 

1 to 530; 

0MP or gp110 without precursor: 34-530; and 
TMP or gp41 protein: 531-877. 

11. The peptide of claim 10 encoded by all 
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or part of one of the following cDNA sequences of figs. 
3A-3F and 7A-7I: 37-130, 211-289, 488-530, 490-620 or 
680-700. 

12. A method for the in vitro detection of the 
presence of an antibody directed against a LAV virus in 
a human body fluid, which comprises: contacting said 
body fluid with an antigen obtained from said virus 
LAV MAL of claim 1 ' said antigen consisting of a peptide 
or a fragment thereof whose amino acid sequence is 
encoded by an open reading frame of a cDNA sequence of 
figs. 7A-7I; and then detecting the immunological 
reaction between said antigen and said antibody. 

13. The method of claim 12 wherein said 

antigen detects said LAV virus of claim 1. 

M AL 

14. The method of claim 12 which comprises the 

steps of: 

a) depositing a predetermined amount of said antigen 
into a cup of a titration microplate; 

b) introducing increasing dilutions of said body fluid 
into said cup; 

c) incubating said microplate; 

d) washing the microplate with a buffer; 

e) adding into said cup a labelled antibody directed 
against blood immunoglobulins; and then 

f) determining whether an antigen-antibody-complex has 
formed in said cup which is indicative of the presence 
of a LAV antibody in said body fluid. 

15. A diagnostic kit for the in vitro de- 
tection of antibodies against a LAV virus, which kit 
comprises: an antigen consisting of a peptide of claim 
7. 

16. The kit of claim 15 wherein the antigen 

consists of a peptide of said LAV mT virus of claim 1, 

MAL 

encoded by the open reading frame of a cDNA sequence of 
said LAV virus. 



48 

17. An immunogenic composition comprising: an 
antigen of the LAV MAL virus of claim 1 or an immunogenic 
peptide or fragment thereof encoded by RNA of said 
virus; and a physiologically acceptable carrier. 

18. The immunogenic composition of claim 17 
wherein said peptide is the gp110 envelope glycoprotein 
or a fragment thereof. 

19. The immunogenic composition of claim 17 
wherein the peptide comprises a protein or glycoprotein 
hwose amino acid sequence is encoded by all or part of 
one of the following cDNA sequences of figs 3a-3F and 
7A-7I: 

OMP or gp110 proteins, including precursors: 

1 to 530; 

OMP or gp110 without precursor: 34-530; and 
TMP or g P 41 : 531-877. 

20. The composition of claim 19 wherein the 
protein or glycoprotein is encoded by all or part of one 
of the following cDNA sequences of Figs. 3A-3F and 
7A-7I: 37-130, 211-289, 488-530, 490-620 or 680-700. 

21. An antibody formed against a peptide of 

claim 7 . 

22. A cell transformed with a DNA recombinant 
of claim 3. 



4 



ABSTRACT 

A variant of 
and capable of causing 
the LAV MAL virus can be 
and pre-AIDS. 
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a LAV virus, designated LAV„„ 

M AL 

AIDS. The cDNA and antigens of 
used for the diagnosis of AIDS 
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GGTCTCTCTTGTTAGACCAGGTCGAGCCCGGGAGCTCTCTGGCTAGCAAGGAACCCACTG 
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cttaagcctcaataaagcttgccttgagtgcctcaa'gcagtgtgtgcccatctgttgtgt 

100 U5— ]. 

gactctggtaactagagatccctcagaccactctagacggtgtaaaaatctctagcagtig 
gcgcccgmc'agggactttamgtgamgtaacagggactcgaaagcggaagttccagag 

200 ... . 

aagttctctcgacgcaggactcggcttgctgaggtgcacacagcaagaggcgagagcggc 

r GAG 300 
HetGlyAlaArg 

GACTGGTGAGTACGCCAATTTTTGACTAGCGGAGGCTAGAAGGAGAGAGATGGGTGCGAG 

ii iii i 

AlaSerValLeuSerGlyGlyLysLeuAspAlaTrpGluLysIleArgLeuArgProGly 
AGCGTCAGTATTAAGCGGGGGAAAATTAGATGCATGGGAGAAAATTCGGTTAAGGCCAGG 

400 

GlyLysLysLysTyrArgLeuLysHisLeuValTrpAlaSerArgGluLeuGluArgPhe 
GGGAAAGAAAAAATATAGACTGAAACATTTAGTATGGGCAAGCAGGGAGCTGGAAAGATT 

AlaLeuAsnProGlyLeuLeuGluThrGiyGluGlyCysGlnGlnlleMetGluGlnLeu 
CGCACTTAACCCTGGCCTTTTAGAAACAGGAGAAGGATGTCAACAAATAATGGAACAGCT 
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ACAATCAACTCTCAAGACAGGATCAGAAGAAATTAAATCATTATATAATACAGTAGCAAC 

600 

LeuTyrCysValHi sGlnArgl 1 eAspValLysAspThrLysGluAlaLeuAspLy s He 
CCTCTATTGTGTACATCAAAGGATAGATGTAAAAGACACCAAGGAAGCGCTAGATAAAAT 

GluGluIleGlnAsnLysSerArgGlnLysThrGlnGlnAlaAlaAlaAlaGlnGlnAia 
AGAGGAAATACAAAATAAGAGCAGGCAAAAGACACAGCAGGCAGCAGCTGCACAGCAGGC 

700 

AlaAlaAlaThrLysAsnSerSerSerValSerGlnAsnTyrProIleValGlnAsnAla 
AGCAGCTGCCACAAAAAACAGCAGCAGTGTCAGTCAAAATTACCCCATAGTGCAAAATGC 

GlnGlyGlnMetlleHisGlnAlalleSerProArgThrLeuAsnAlaTrpValLysVal 
ACAAGGGCAAATGATACATCAGGCCATATCACCTAGGACTTTGAATGCATGGGTGAAAGT 

800 

IleGluGluLysAlaPheSerProGluVallleProMetPheSerAlaLeuSerGluGly 
AATAGAAGAAAAGGCTTTCAGCCCAGAAGTGATACCCATGTTCTCAGCATTATCAGAGGG 

900 

AlaThrProGlnAspLeuAsnMetMetLeuAsnlleValGlyGlyHisGlnAlaAlaMet 
GGCCACCCCACAAGATTTAAATATGATGCTGAACATAGTTGGAGGACACCAGGCAGCTAT 

GlnMetLeuLysAspThrlleAsnGluGiuAlaAlaAspTrpAspArgValHisProVal 
GCAAATGTTAAAAGATACCATCAATGAGGAAGCTGCAGACTGGGACAGGGTACATCCAGT 

1000 

HisAlaGlyProIleProProGlyGlnMetArgGluProArgGlySerAspIleAlaGly 
ACATGCAGGGCCTATTCCCCCAGGCCAGATGAGAGAACCAAGAGGAAGTGACATAGCAGG 

• i.i i ■ i 
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ThrThrSerThrLeuGlnGluGlnlleGlyTrpMetThrSerAsnProProIleProVal 
AACTACTAGTACCCTTCAAGAACAAATAGGATGGATGACAAGCAACCCACCTATCCCAGT 

1100 

GlyAspIleTyrLysArgTrpIlelleLeuGlyLeuAsnLysIleValArgMetTyrSer 
GGGAGACATCTATAAAAGATGGATAATCCTGGGATTAAATAAAATAGTAAGAATGTATAG 

1200 

ProValSerlleLeuAspIleArgGlnGlyProLysGluProPheArgAspTyrValAsp 
CCCTGTCAGCATTTTGGACATAAGACAAGGGCCAAAGGAACCTTTTAGAGACTATGTAGA 

i i i i i i 

ArgPhePheLysThrLeuArgAlaGluGlnAlaThrGlnGluValLysAsnTrpMetThr 
TAGGTTCTTTAAAACTCTCAGAGCTGAGCAAGCTACACAGGAGGTAAAAAATTGGATGAC 

1300 

GluThrLeuLeuValGlnAsnAlaAsnProAspCysLysThrlleLeuLysAlaLeuGly 
AGAAACCTTGCTGGTCCAAAATGCGAATCGAGACTGTAAGACCATTTTAAAAGCATTAGG 

ProGlyAlaThrLeuGluGluMetMetThrAlaCysGlnGlyValGlyGlyProSerHis 
ACCAGGGGCTACATTAGAAGAAATGATGACAGCATGCCAGGGAGTGGGAGGACCCAGTCA 

1400 

LysAlaArgValLeuAlaGluAlaMetSerGlnAlaThrAsnSerThrAlaAlalleMet 
TAAAGCAAGAGTTTTGGCTGAGGCAATGAGCCAAGCAACAAATTCAACTGCTGCCATAAT 

1500 

MetGlnArgGlyAsnPheLysGlyGlnLysArglleLysCysPheAsnCysGlyLysGlu 
GATGCAGAGAGGTAATTTTAAGGGCCAGAAAAGAATTAAGTGTTTCAACTGTGGCAAAGA 



1600 rPOL , 

PhePheArgGluAsnLeu 

GluGlyHisGlnMetLysAspCysThrGluArgGlnAlaAsnPheLeuGlyLysIleTrp 
GGAAGGACACCAAATGAAAGACTGCACTGAGAGACAGGCTAAffTTTTTAGGGAAAATTTG 




GCCTTCCCACAAGGGAAGGCCAGGGAATTTCCTTCAGAGCAGACCAGAGCCAACAGCCCC 

1700 , 

ThrSerArgGluLeuArgValTrpGlyGlyAspLysThrLeuSerGluThrGlyAlaGlu 
ProAlaGluSerPheGlyPheGlyGluGluIleLysProSerGlnLysGlnGluGlnLys 
ACCAGCAGAGAGCTTCGGGTTTGGGGAGGAGATAAAACCCTCTCAGAAACAGGAGCAGAA 

1800 

ArgGlnGlylleValSerPheSerPheProGlnlleThrLeuTrpGlnArgProValVal 
AspLysGluLeuTyrProLeuAlaSerLeuLysSerLeuPheGlyAsnAspGlnLeuSer 
AGACAAGGAATTGTATCCTTTAGCTTCCCTCAAATCACTCTTTGGCAACGACCAGTTGTC 
GAG *i • i 

ThrValArgValGlyGlyGlnLeuLysGluAlaLeuLeuAspThrGlyAlaAspAspThr 
Gin 

ACAGfTAAGAGTAGGAGGACAGCTAAAAGAAGCTCTATTAGACACAGGAGCAGATGATACA 

1900 

ValLeuGluGluIleAsnLeuProGlyLysTrpLysProLysMetlleGlyGlylleGly 
GTATTAGAAGAAATAAATTTGCCAGGAAAATGGAAACCAAAAATGATAGGGGGAATTGGA 

■ i i i i i 

GlyPhelleLysValArgGlnTyrAspGlnlleLeuIleGluIleCysGlyLysLysAla 
GGTTTTATCAAAGTAAGACAGTATGATCAAATACTTATAGAAATTTGTCGAAAAAAGGCT 

2000 FIG. 7B 



IleGlyThrlleLeuValGlyProThrProValAsnllelleGlyArgAsnMetLeuThr 
ATAGGTACAATATTGGTAGGACCTACACCTGTCAACATAATTGGACGAAATATGTTGACT 

2100 

GlnlleGlyCysThrLeuAsnPheProIleSerProIleGluThrValProValLysLeu 
CAGATTGGTTGTACTTTAAATTTTCCAATTAGTCCTATTGAGACTGTACCAGTAAAATTA 

i i i i i i 

LysProGlyMetAspGlyProArgValLysGlnTrpProLeuThrGluGluLysIleLys 
AAGCCAGGGATGGATGGCCCAAGGGTTAAACAATGGCCATTGACAGAAGAAAAAATAAAA 

2200 

AlaLeuThrGluIleCysLysAspMetGluLysGluGlyLysIleLeuLysIleGlyPro 
GCATTAACAGAAATTTGTAAAGATATGGAAAAGGAAGGAAAAATTTTAAAAATTGGGCCT 

i iii ii 

GluAsnProTyrAsnThrProValPheAlalleLysLysLysAspSerThrLysTrpArg 
GAAAATCCATACAATACTCCAGTATTTGCCATAAAGAAAAAAGACAGCACTAAATGGAGA 

2300 , 

LysLeuValAsnPheArgGluLeuAsnLysArgThrGlnAspPheTrpGluValGlnLeu 
AAATTAGTGAATTTCAGAGAGCTTAATAAAAGAACTCAAGATTTTTGGGAAGTTCAATTA 

2400 

GlylleProHisProAlaGlyLeuLysLysLysLysSerValThrValLeuAspValGly 
GGAATACCACATCCTGCTGGGTTGAAAAAGAAAAAATCAGTCACAGTATTGGATGTGGGG 

i i i i i i 

AspAlaTyrPheSerValProLeuAspGluAspPheArgLysTyrThrAlaPheThrlle 
GATGCATATTTTTCAGTCCCTTTAGATGAAGATTTCAGGAAGTATACTGCATTCACTATA 

2500 

ProSerlleAsnAsnGluThrProGlylleArgTyrGlnTyrAsnValLeuProGlnGly 
CCCAGTATTAATAATGAGACACCAGGGATTAGATATCAGTACAATGTGCTACCACAGGGA 

TrpLysGlySerProAlaliePheGlnSerSerMetThrLysIleLeuGiuProPheArg 
TGGAAAGGATCACCAGCAATATTCCAGAGTAGCATGACAAAAATCTTAGAACCCTTT AGA 

2600 

ThrLysAsnProGluIleVallleTyrGlnTyrMetAspAspLeuTyrValGlySerAsp 
ACAAAAAATCCAGAAATAGTCATATACCAATACATGGATGATTTGTATGTAGGGTCTGAT 

2700 

LeuGluIleGlyGlnHisArgThrLysIleGluGluLeuArgGluHisLeuLeuLysTrp 
TTAGAAATAGGACAACATAGAACAAAAATAGAGGAACTAAGAGAACATCTATTGAAATGG 

GlyPheThrThrProAspLysLysHisGlnLysGluProProPheLeuTrpMetGlyTyr 
GGATTTACCACACCAGACAAAAAGCATCAGAAAGAACCCCCATTTCTTTGGATGGGGTAT 

2800 

GluLeuHisProAspLysTrpThrValGlnProIleGlnLeuProAspLysGluSerTrp 
GAACTCCACCCTGACAAATGGACAGTGCAGCCTATACAACTGCCAGACAAGGAAAGCTGG 

ThrValAsnAspIleGlnLysLeuValGlyLysLeuAsnfrpAlaSerGinlleTyrPro 
ACTGTCAATGATATACAGAAATTGGTGGGAAAACTAAATTGGGCAAGTCAGATTTATCCA 

2900 

GlylleLysValLysGlnLeuCysLysLeuLeuArgGlyAlaLysAlaLeuThrAspIle 
GGAATTAAAGTAAAGCAATTATGTAAACTCCTTAGGGGAGCAAAAGCACTAACAGACATA 

3000 

ValProLeuThrAlaGluAlaGluLeuGluLeuAlaGluAsnArgGluIleLeuLysGlu 
GTACCATTAACTGCAGAGGCAGAATTAGAATTGGCAGAGAACAGGGAAATTCTAAAAGAA 



FIG. 7C 



ProValHisGlyValTyrTyrAspProSerLysAspLeuIleAlaGluIleGlnLysGln 
CCAGTGCATGGGGTATATTATGACCCATCAAAAGACTTAATAGCAGAAATACAGAAGCAG 

3100 



LysTyrAlaArglleLysSerAlaHisThrAsnAspValLysGlnLeuTh'rGluAlaVal 
AAGTATGCAAGAATAAAGTCTGCCCACACTAATGATGTAAAACAATTAACAGAAGCAGTG 

3200 , 



3300 
LePro 

ATACAAAAAGAAACATGGGAGGCATGGTGGACAGAATATTGGCAAGCCACCTGGATCCCT 



IleGlnLysGluThrTrpGluAlaTrpTrpThrGluTyrTrpGlnAlaThrTrpIlePro 

— - — — - • - - - -^iji^ij — - ~- ----- - - 



GluTrpGluPheValAsnThrProProLeuValLysLeuTrpTyrGlnLeuGluThrGlu 
. . _ TAAAAC ~ " ~" — 



GAATGGGAGTTTGTCAATACTCCTCCCCTAGTAAAACTATGGTACCAGTTAGAAACAGAA 

3400 

ProIleValGlyAlaGluThrPheTyrValAspGlyAlaAlaAsnArgGluThrLysLys 
CCCATAGTAGGAGCAGAAACTTTCTATGTAGATGGGGCAGCTAATAGAGAAACTAAAAAG 

GlyLysAlaGlyTyrValThrAspArgGlyArgGlnLysValValSerLeuThrGluThr 
GGAAAAGCAGGATATGTTACTGACAGAGGAAGACAAAAGGTTGTCTCCTTAACTGAAACA 

3500 

ThrAsnGlnEysThrGluL^uGlnAlall^HisLeuAlaLeuGlnAspS^rGlySerGlii 
ACAAATCAGAAGACTGAATTACAAGCAATCCACTTAGCTTTACAGGATTCAGGATCAGAA 

3600 

Val Asnl 1 eValThr AspSerGlnTyr AlaLeuGlyl le 1 1 eGlnAlaGlnPr oAspLys 
GTAAACATAGTAACAGACTCACAGTATGCATTAGGGATTATTCAAGCACAACCAGATAAA 

SerGluSerGluIleValAsnGlnllelleGluGlnLeuileGlnLysAspLysValTyr 
AGTGAATCAGAGATTGTTAATCAAATAATAGAGCAATTAATACAGAAGGACAAGGTCTAC 

3700 



SerSerGlylleArgLysValLeuPheLeuAspGlylleAspLysAlaGlnGluGluHis 
AGCAGTGGAATCAGAAAGGTACTATTTTTAGATGGGATAGATAAGGCTCAAGAAGAACAT 

3800 .... 
GluLysTyrHisSerAsnTrpArgAlaMetAlaSerAspPheAsnLeuProProIleVal 
GAAAAATATCACAGCAATTGGAGAGCAATGGCTAGTGACTTTAATCTACCACCTATAGTA 

3900 

AlaLysGluIleValAlaSerCysAspLysCysGlnLeuLysGlyGluAlaMetHisGly 
GCGAAGGAAATAGTAGCCAGCTGTGATAAATGTCAACTAAAAGGGGAAGCCATGCATGGA 

GlnValAspCysSerProGiylleTrpGlnLeuAspCysThrHisLeuGiuGlyLysIle 
CAAGTAGACTGTAGTCCAGGGATATGGCAATTAGATTGCACACATCTAGAAGGAAAAATA 

4000 

IlelleValAlaValHisValAlaSerGlyTyrlleGluAlaGluVallleProAlaGlu 
ATCATAGTAGCAGTCCATGTAGCCAGTGGATATATAGAAGCAGAAGTTATCCCAGCAGAA 



4100 FIG.7D 



ValHisThrAspAsnGlySerAsnPheThrSerAlaAlaValLysAlaAlaCysTrpTrp 
GTACACACAGACAATGGCAGCAATTTCACCAGTGCTGCAGTTAAAGCAGCCTGTTGGTGG 

4200 

AlaAsnlleLysGlnGluPheGlylleProTyrAsnProGlnSerGlnGlyValValGlu 
GCAAATATCAAACAGGAATTTGGAATTCCCTACAACCCCCAAAGTCAAGGAGTAGTGGAA 



4300 . 

LysGlyGlylleGly 
AAAGGGGGGATTGGG 



4400 



LeuGlnLysGlnlleThrLysIleGlnAsnPheArgValTyrTyrArgAspAsnArgAsp 
TTACAAAAACAAATTACAAAAATTCAAAATTTTCGGGTTTATTACAGGGAC ~ " " 



AACAGAGAC 
4500 



AspAsnSerAspIleLysValValProArgArgLysAlaLysIlelleArgAspTprGly 

MetGlu 

GACMTAGTGATATAAAGGTAGTACCAAGAAGAAAAGCAAAAATCATTAGGGATIIATGGA. 

4600 POL- 




LeArgThrTrpHis 

AMCAGATGGCAGGTGATGATTGTGTGGCAGGTGGACAGGATGAGGATlTAGAACATGGCA 

i i i i i i 

SerLeuValLysHisHisMetTyrValSerLysLysAlaLysAsnTrpPheTyrArgHis 
CAGTTTAGTAAAACATCATATGTATGTCTCAAAGAAAGCTAAAAATTGGTTTTATAGACA 
. 4700 ■ 

HisTyrGluSerArgHisProLysValSerSerGluValHisIleProLeuGlyAspAla 
TCACTATGAAAGCAGGCATCCAAAAGTAAGTTCAGAAGTACACATCCCACTAGGGGATGC 

4800 

ArgLeuValValArgThrTyrTrpGlyLeuGlnThrGlyGluLysAspTrpHisLeuGly 
TAGATTAGTAGTAAGAACATATTGGGGTCTGCAAACAGGAGAAAAAGACTGGCACTTGGG 



4900 

AlaAspGlnLeuIleHisLeuTyrTyrPheAspCysPheSerGluSerAlalleArgGln 
AGCAGACCAACTGATTCATCTGTACTATTTTGATTGTTTTTCAGAATCTGCCATAAGACA 

AlalleLeuGlyHisIleValSerProArgCysAspTyrGlnAlaGlyHisAsnLysVal 
AGCCATATTAGGACATATAGTTAGTCCTAGGTGTGATTATCAAGCAGGACATAACAAGGT 

5000 , . , 

GlySerLeuGlnTyrLeuAlaLeuThrAlaLeuIleAlaProLysLysThrArgProPro 
AGGATCTTTACAGTATTTGGCACTAACAGCATTAATAGCACCAAAAAAGACAAGGCCACC 

. |— R , . 5100 

MetGluGlnAlaProAlaAspGlnGly 
LeuProSerValArgLysLeuThrGluAspArgTrpAsnLysProGlnGlnThrLysGly. 
TTTGCCTAGTGTTAGGAAGCTAACAGAAGATAGiATGGAACAAGCCCCAGCAGACCAAGGG 

FIG.7E 




5200 

AlaValArgHisPheProArglleTrpLeuHisSerLeuGlyGlnHisIleTyrGluThr 
GCTGTCAGACACTTTCCTAGGATATGGCTCCATAGTTTAGGACMCATATCTATGAAACT 

i i i i i i 

TyrGlyAspThrTrpGluGlyValGluAlallelleArgSerLeuGlnGlnLeuLeuPhe 
TATGGGGATACCTGGGAAGGAGTTGAAGCTATAATAAGAAGTCTGCAACAACTGCTGTTT 

5300 . ■ - . 

IleHisPheArglleGlyCysGlnHisSerArglleGlylleThrArgGlnArgArgAla 
ATTCATTTCAGAATTGGGTGTCAACATAGCAGAATAGGCATTACTCGACAGAGAAGAGCA 

R— i— | . , , 5400 

DAsnLeuGluProTrpAsnHisProGlySerGlnProArg 



ArgAsnGlySerSerArgSer 
MetAspProValAspPr 
AGAMTGGATCCAGTAGATCCITAACTTAGAGCCCTG5AACCATCCAGGGAGTCAGCCTAG 

ThrProCysAsnLysCysTyrCysLysLysCysCysTyrHisCysGlnMetCysPhelie 
GACGCCTTGTAATAAGTGTTATTGTAAAAAGTGCTGCTATCATTGCCAAATGTGCTTCAT 

5500 

ThrLysGlyLeuGlylleSerTyrGlyArgLysLysArgArgGlnArgArgArgProPro 
AACGAAAGGCTTAGGCATCTCCTATGGCAGGAAGAAGCGGAGACAGCGACGAAGACCTCC 

GlnGlyAsnGlnAlaHisGlnAspProLeuProGluGlrJ 
TCAGGGCAATCAGGCTCATCAAGATCCTCTACCAGAGCAQTAAGTAGTATATGTAATACA 

5600 

ACCTTTAGTGATATTAGCAATAGTAGCATTAGTAGTAACGCTAATAATAGCAATAGTTGT 

5700 

GTGGACCATAGTATTTATAGAAATTAGGAAAATAAGAAGACAAAGGAAAATAGACAGGTT 

r^ENV 

MetArgValArgGluIleGlnArg 
GATTGATAGAATAAGAGAAAGAGCAGAAGATAGTGGCAATGAGAGTGAGGGAGATACAGA 

5900 . • 

AsnTyrGlnAsnTrpTrpArgTrpGlyMetMetLeuLeuGlyMetLeuMetThrCysSer 
^ATTATCAAAACTGGTGGAGATGGGGCATGATGCTCCTTGGGATGTTGATGACCTG'] 



GGAATTATCAAAACTGGTGGAGATGGGGCATGATGCTCCTTGGGATGTTGATGACCTGTA 

IleAlaGluAspLeuTrpValThrVal^rTyrGlyValProValTrpLysGluAlaThr 
GTATTGCAGAAGATTTGTGGGTTACAGTTTATTATGGGGTACCTGTGTGGAAAGAAGCAA 

5900 .... 
ThrThrLeuPheCysAlaSerAspAlaLysSerTyrGluThrGluValHisAsnlleTrp 
CCACTACTCTATTTTGTGCATCAGATGCTAAATCATATGAAACAGAAGTACATAACATCT 

. . . . 6000 

AlaThrHisAlaCysValProThrAspProAsnProGlnGluIleGluLeuGluAsnVal 
GGGCTACACATGCCTGTGTACCCACGGACCCCAACCCACAAGAAATAGAACTGGAAAATG 

i i i i ii 

ThrGluGlyPheAsnMetTrpLysAsnAsnMetValGluGlnMetHisGluAspIlelle 
TCACAGAAGGGTTTAACATGTGGAAAAATAACATGGTGGAGCAGATGCATGAGGATATAA 

6100 



FIG. 7F 



+ 



SerLeuTrpAspGlnSerLeuLysProCysValLysLeuThrProLeuCysValThrLeu 
TCAGTTTATGGGATCAAAGCCTAAMCCATGTGTAAAGCTAACCCCACTCTGTGTCACTT 

i i i i ii 

AsnCysThrAsnValAsnGlyThrAlaValAsnGlyThrAsnAlaGlySerAsnArgThr 
TAAACTGCACTAATGTGAATGGGACTGCTGTGAATGGGACTAATGCTGGGAGTAATAGGA 

6200 

AsnAlaGluLeuLysMetGluIleGlyGluValLysAsnCysSerPheAsnlleThrPro 
CTAATGCAGAATTGAAAATGGAAATTGGAGAAGTGAAAAACTGGTCTTTCAATATAACCC 
, , , , , 6300 

ValGlySerAspLysArgGlnGluTyrAlaThrPheTyrAsnLeuAspLeuValGlnlle 
CAGTAGGAAGTGATAAAAGGCAAGAATATGCAACTTTTTATAACCTTGATCTAGTACAAA 

i i i i i i 

AspAspSerAspAsnSerSerTyrArgLeuIleAsnCysAsnThrSerVallleThrGln 
TAGATGATAGTGATAATAGTAGTTATAGGCTAATAAATTGTAATACCTCAGTAATTACAC 

6400 

AlaCysProLysValThrPheAspProIleProIleHisTyrCysAlaProAlaGlyPhe 
AGGCTTGTCCAAAGGTAACCTTTGATCCAATTCCCATACATTATTGTGCCCCAGCTGGTT 



6500 . 

ThrValGlnCysThrHisGlylleLysProValValSerThrGlnLeuLeuLeuAsnGly 
GTACAGTACAATGTACACATGGAATTAAGCCAGTGGTGTCAACTCAACTGCTGTTAAATG 

6600 

SerLeuAlaGluGluGluIleMetlleArgSerGluAsnLeuThrAspAsnThrLysAsn 
GCAGTCTAGCAGAAGAAGAGATAATGATTAGATCTGAAAATCTCACAGACAATACTAAAA 

i i i i ii 

IlelleValGlnLeuAsnGluThrValThrlleAsnCysThrArgProGlyAsnAsnThr 
ACATAATAGTACAGCTTAATGAAACTGTAACAATTAATTGTACAAGGCCTGGAAACAATA 

6700 

ArgArgGlylleHisPheGlyProGlyGlnAlaLeuTyrThrThrGlylleValGlyAsp 
CAAGAAGAGGGATACATTTCGGCCCAGGGCAAGCACTCTATACAACAGGGATAGTAGGAG 

i i iii i 

IleArgArgAlaTyrCysThrlleAsnGluThrGluTrpAspLysThrLeuGlnGlnVal 
ATATAAGAAGAGCATATTGTACTATTAATGAAACAGAATGGGATAAAACTTTACAACAGG 

6800 .... 



TAGCTGT7 

..... 6900 
GlyAspProGluIleThrThrHisSerPheAsnCysArgGlyGluPhePheTyrCysAsn 



LyAsp 

GAGGGGACCCAGAAATTACAACACACAGTTTTAATTGTAGAGGSGAATTTTTCTACTGTA 

I I I III 

ThrSerLysLeuPheAsnSerThrTrpGlnAsnAsnGlyAlaArgLeuSerAsnSerThr 
ATACATCAAAACTGTTTAATAGTACATGGCAGAATAATGGTGCAAGACTAAGTAATAGCA 

7000 

GluSerThrGlySerlleThrLeuProCysArglleLysGlnllelleAsnMetTrpGln 
CAGAGTCAACTGGTAGTATCACACTCCCATGCAGAATAAAACAAATTATAAATATGTGGC 

i.i t i ■ i 

LysThrGlyLysAlaMetTyrAlaProProIleAlaGlyVallleAsnCysLeuSerAsn 
AGAAAACAGGAAAAGCTATGTATGCCCCTCCCATCGCAGGAGTCATCAACTGTTTATCAA 

7100 .... 

IleThrGlyLeuIleLeuThrArgAspGlyGlyAsnSerSerAspAsnSerAspAsnGlu 
ATATTACAGGGCTGATATTAACAAGAGATGGTGGAAATAGTAGTGACAATAGTGACAATG 

FIG. 7G 



ThrLeuArgProGlyGlyGlyAspMetArgAspAsnTrpIleSerGluLeuTyrLysTyr 
AGACCTTAAGACCTGGAGGAGGAGATATGAGGGACAATTGGATAAGTGAATTATATAAAT 

t i i t i i 

LysValValArglleGluProLeuGlyValAlaProThrLysAlaLysArgArgValVal 
ATAAAGTAGTAAGAATTGAACCCCTAGGAGTAGCACCCACCAAGGCAAAGAGAAGAGTGG 

7300 i ' • 

GluArgGluLysArgAlalleGlyLeuGlyAlaMetPheLeuGlyPheLeuGlyAlaAla 
TGGAAAGAGAAAAAAGAGCAATAGGACTAGGAGCCATGTTCCTTGGGTTCTTGGGAGCAG 

GlySerThrMetGlyAlaAlaSerLeuThrLeuThrValGlnAlaArgGlnLeuLeuSer 
CAGGAAGCACGATGGGCGCAGCGTCACTAACGCTGACGGTACAGGCCAGACAGTTACTGT 

7400 . . 

GlylleValGlnGlnGlnAsnAsnLeuLeuArgAlalleGluAlaGlnGlnHisLeuLeu 
CTGGTATAGTGCAACAGCAAAACAATTTGCTGAGGGCTATAGAGGCGCAACAGCATCTGT 

. , . 7500 

GlnLeuThrValTrpGlylleLysGlnLeuGlnAlaArgValLeuAlaValGluArgTyr 
TGCAACTCACGGTCTGGGGCATTAAACAGCTCCAGGCAAGAGTCCTGGCTGTGGAAAGAT 

i i i i ■ i 

LeuGlnAspGlnArgLeuLeuGlyMetTrpGlyCysSerGlyLysHisIleCysThrThr 
ACCTACAGGATCAACGGCTCCTAGGAATGTGGGGTTGCTCTGGAAAACACATTTGCACCA 

7600 

PheVal Pr oTrpAsnSer SerTrpSerAsnArgSerLeuAspAspI leTrpAsnAsnMe t 
CATTTGTGCCTTGGAACTCTAGTTGGAGTAATAGATCTCTAGATGACATTTGGAATAATA 
i i i i ■ • 

ThrTrpMetGlnTrpGluLysGluIleSerAsnTyrThrGlyllelleTyrAsnLeuIle 
TGACCTGGATGCAGTGGGAAAAAGAAATTAGCAATTACACAGGCATAATATACAACTTAA 

7700 .... 
GluGluSerGlnlleGlnGlnGluLysAsnGluLysGluLeuLeuGluLeuAspLysTrp 
TTGAAGAATCGCAAATCCAGCAAGAAAAGAATGAAAAGGAATTATTGGAATTGGACAAGT 

7800 

AlaSerLeuTrpAsnTrpPheSerlleSerLysTrpLeuTrpTyrlleArgllePhelle 
GGGCAAGTTTGTGGAATTGGTTTAGCATATCAAAATGGCTGTGGTATATAAGAATATTCA 

i i i i i ' 

IleValValGlyGlyLeuIleGlyLeuArgllellePheAlaValLeuSerLeuValAsn 
TAATAGTAGTAGGAGGCTTAATAGGTTTAAGAATAATTTTTGCTGTGCTTTCTTTAGTM 

7900 

ArgValArgGlnGlyTyrSerProLeuSerLeuGlnThrLeuLeuProThrProArgGly 
ATAGAGTTAGGCAGGGATACTCACCTCTGTCGTTGCAGACCCTCCTCCCAACACCGAGGG 

i i i i i ■ 

ProProAspArgProGluGlylleGluGluGluGlyGlyGluGlnGlyArgGlyArgSer 
GACCACCCGACAGGCCCGAAGGAATAGAAGAAGAAGGTGGAGAGCAAGGCAGAGGCAGAT 

8000 

IleArgLeuValAsnGlyPheSerAlaLeuIleTrpAspAspLeuArgAsnLeuCysLeu 
CAATTCGATTGGTGAACGGATTCTCAGCACTTATCTGGGACGACCTGAGGAACCTGTGCC 

8100 

PheSerTyrHisArgLeuArgAspLeuLeuLeuIleAlaThrArglleValGluLeuLeu 
TCTTCAGTTACCACCGCTTGAGAGACTTACTCTTAATTGCAACGAGGATTGTGGAACTTC 

i i i i i i 

GlyArgArgGlyTrpGluAlaLeuLysTyrLeuTrpAsnLeuLeuGlnTyrTrpGlyGln 
TGGGACGCAGGGGGTGGGAAGCCCTCAAATATCTGTGGAATCTCCTGCAATATTGGGGTC 

8200 



FIG. 7H 



GluLeuLysAsnSerAlalleSerLeuLeuAsnThrThrAlalleAlaValAlaGluCys 
AGGAACTGAAGAATAGTGCTATTAGCTTGCTTAATACCACAGCAATAGCAGTAGCTGAAT 

Thr AspArgVal I leGluI leGlyGlnArgPheGlyArgAlal 1 eLeuHi s 1 1 ePr oArg . 
GCACAGATAGGGTTATAGAAATAGGACAAAGATTTGGTAGAGCTATTCTCCACATACCTA 

8300 • F 

EW- — 1 MetGlyGlyLysTrpSerLys 
Argil eArgGlnGlyPheGluArgAlaLeuLed 
GMGMTTAGACAGGGCTTCGAMGGGCTTTGCTAfTAAdATGGGTGGCAAGTGGTCAAAA 

8400 

SerSerlleValGlyTrpProLysIleArgGluArglleArgArgThrProProThrGlu 
AGTAGCATAGTAGGATGGCCTAAGATTAGGGAAAGAATAAGACGAACTCCCCCAACAGAA. 

ThrGlyValGlyAlaValSerGlnAspAlaValSerGlnAspLeuAspLysCysGlyAla 
ACAGGAGTAGGAGCAGTATCTCAAGATGCAGTATCTCAAGATTTAGATAAATGTGGAGCA 

8500 

AlaAlaSerSerSerProAlaAlaAsnAsnAlaSerCysGluProProGluGluGluGlu 
GCCGCAAGCAGCAGTCCAGCAGCTAATAATGCTAGTTGTGAACCACCAGAAGAAGAGGAG 

GluValGlyPheProValArgProGlnVaiProLeuArgProMetThrTyrLysGlyAla 
GAGGTAGGCTTTCCAGTCCGTCCTCAGGTACCTTTAAGACCAATGACTTATAAAGGAGCT 

8600 , (— U3, 

PheAspLeuSerHisPheLeuLysGluLysGlyGlyLeuAspGlyLeuValTrpSerPro 
TTTGATCTCAGCCACTTTTTAAAAGAAAAGGGGGQACTGGATGGGTTAGTTTGGTCCCCA 

8700 

LysArgGlnGluIleLeuAspLeuTrpValTyrHisThrGlnGlyTyrPheProAspTrp 
AAAAGACAAGAAATCCTTGATCTGTGGGTCTACCACACACAAGGCTACTTCCCTGATTGG 

GlnAsnTyrThrProGlyProGlylleArgPheProLeuThrPheGlyTrpCysPheLys 
CAGAATTACACACCAGGGCCAGGGATTAGATTCCCACTGACCTTCGGATGGTGCTTTAAG 

.8800 

LeuValProMetSerProGluGluValGluGluAlaAsnGluGlyGluAsnAsnCysLeu 
TTAGTACCAATGAGTCCAGAGGAAGTAGAGGAGGCCAATGAAGGAGAGAACAACTGTCTG 



8900 , 

PheAspSerSerLeuAlaLeuArgHisArgAlaArgGluGlnHisProGluTyrTyrLys 
TTTGACAGCAGCCTAGCACTAAGACACAGAGCCAGAGAACAACATCCGGAGTACTACAAA 
F*— i . . . . 9000 

AspCysl 

GACTGCITGACACAGAAGTTGCTGACAGGGGACTTTCCGCTGGGGACTTTCCAGGGGAGGC 



GTAACTTGGGCGGGACCGGGGAGTGGCTAACCCTCAGATGCTGCATATAAGCAGCTGCTT 

•U3-T-R ■ • 9100 

TTCGCCTGTACTGpGTCTCTCTTGTTAGACCAGGTCGAGCCCGGGAGCTCTCTGGCTAGC 

AAGGAACCCAGTGCTTAAGCCTCAATAAAGCTTGCCTTGAGTGCCTCAA ' 

9200 



FIG. 71 
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Detailed Office Action 

Status of the Claims 

Acknowledgement is hereby made of receipt and entry of the 
amendment filed 09 December, 2004. Claims 27, 28/31, 32, and 33 
were amended and new claims 4 0-50 introduced. Newly submitted 
claims 40-50 are directed to an invention that is independent or 
distinct from the invention originally claimed. The claims are 
directed toward invariant peptide sequences whereas the originally 
presented claims are directed toward peptidic variants. Each of 
the identified groups contains structurally and functionally 
different polypeptides. Since applicant has received an action on 
the merits for the originally presented invention, this invention 
has been constructively elected by original presentation for 
prosecution on the merits. Accordingly, claims 40-50 are withdrawn 
from further consideration as being directed towards a nonelected 
invention (refer to 37 C.F.R. § 1.142(b) and M.P.E.P. § 821.03). 
Claims 23-28 and 31-39 are currently under examination. 

35 U.S. C. § 112, Second Paragraph 

The previous rejection of claims 27, 28, 32, and 33 under 35 
U.S.C. § 112, second paragraph, as being indefinite for failing to 
particularly point out and distinctly claim the subject matter 
which applicant regards as the invention, is hereby withdrawn in 
response to applicants' amendment. 

35 U.S.C. § 112, First Paragraph 

The following is a quotation of the first paragraph of 35 U.S.C. 
§ 112.: 

The specification shall contain a written description of the 

invention, and of the manner and process of making and using it, 

in such full, clear, concise, and exact terms as to enable any 
person skilled in the art to which it pertains, or with which it 
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is most nearly connected, to make and use the same and shall set 
forth the best mode contemplated by the inventor of carrying out 
his invention. 

Claims 23-28 and 31-39 stand rejected under 35 U.S. C. § 112, 
first paragraph, as containing subject matter which was not 
described in the specification in such a way as to reasonably 
convey to one skilled in the relevant art that the inventor (s) , at 
the time the application was filed, had possession of the claimed 
invention. In re Rasmussen, 650 F.2d 1212, 211 U.S.P.Q. 323 
(C.C.P.A. 1981). In re Wertheim, 541 F.2d 257, 191 U.S.P.Q. 90 
(C.C.P.A. 1976) . Claims 23-26 and 31 are directed toward 
immunogenic HIV-1 Env polypeptides of 5-150 aa comprising at least 
one amino acid substitution at a specified position (e.g., aa 8, 9, 
90, etc.). Claims 27, 28, 32, and 33 are directed toward 
methodologies that require these peptides. Claims 34-39 are also 
directed toward immunogenic HIV-1 Env polypeptides comprising one 
of the aforementioned substitutions and include additional 
limitations pertaining to the overall peptide length (e.g., 21 aa, 
43 aa, 79 aa, etc . ) . 

As previously set forth, in order to satisfy the written 
description requirement, a patent specification must describe the 
claimed invention in sufficient detail that one skilled in the art 
can reasonably conclude that the inventor had possession of the 
claimed invention. See, e.g., Vas-Cath, Inc., v. Mahurkar, 935 
F.2d at 1563, 19 U.S.P.Q. 2d at 1116. The issue raised in this 
application is whether the original application provides adequate 
support for the broadly claimed genus of immunogenic polypeptide 
fragments comprising HIV-Imal epitopes of 5-150 amino acid residues 
wherein at least one amino acid residue is substituted at one of 
the specified positions. An applicant shows possession of the 
claimed invention by describing the claimed invention with all of 
its limitations using such descriptive means as words, structures, 
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figures, diagrams, and formulas that fully set forth the claimed 
invention. Lockwood v. American Airlines, Inc., 107 F.3d 1565, 
1572, 41 U.S.P.Q.2d 1961, 1966 (Fed. Cir, 1997). The claimed 
invention as a whole may not be adequately described where an 
invention is described solely in terms of a method of its making 
coupled with its function and there is no described or art- 
recognized correlation or relationship between the structure of the 
invention and its function. A biomolecule sequence (e.g., epitope) 
described only by functional characteristic, without any known or 
disclosed correlation between that function and the structure of 
the sequence, normally is not a sufficient identifying 
characteristic for written description purposes, even when 
accompanied by a method of obtaining the biomolecule of interest. 
In re Bell, 991 F.2d 781, 26 U.S.P.Q.2d 1529 (Fed. Cir. 1993). In 
re Deuel, 51 F.3d 1552, 34 U.S. P. Q. 2d 1210 (Fed. Cir. 1995). A 
lack of adequate written description issue also arises if the 
knowledge and level of skill in the art would not permit one 
skilled in the art to immediately envisage the product claimed from 
the disclosed process. See, e.g., Fujikawa v. Wattanasin, 93 F.3d 
1559, 1571, 39 U.S.P.Q.2d 1895, 1905 (Fed. Cir. 1995). The court 
noted in this decision that a laundry list disclosure of every 
possible moiety does not constitute a written description of every 
species in a genus because it would not reasonably lead those 
skilled in the art to any particular species. 

An applicant may show possession of an invention by disclosure 
of drawings or structural chemical formulas that are sufficiently 
detailed to show that applicant was in possession of the claimed 
invention as a whole. An applicant may also show that an invention . 
is complete by disclosure of sufficiently detailed, relevant 
identifying characteristics which provide evidence that applicant 
was in possession of the claimed invention, i.e., complete or 
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partial structure, other physical and/or chemical properties, 

functional characteristics when coupled with a known or disclosed 

i 

correlation between function and structure, or some combination of 
such characteristics. For some biomolecules, examples of 
identifying characteristics include a nucleotide or amino acid 
sequence, chemical structure, binding affinity, binding 
specificity, and molecular weight. . The written description 
requirement may be satisfied through disclosure of function and 
minimal structure when there is a well-established correlation 
between structure and function. Without such a correlation, the 
capability to recognize or understand the structure from the mere 
recitation of function and minimal structure is highly unlikely. 
In the latter case, disclosure of function alone is little more 
than a wish for possession; it does not satisfy the written 
description requirement. Regents of the University of California 
v. Eli Lilly, 119 F.3d 1559, 1566, 43 U.S.P.Q.2d 1398, 1404, 1406 

(Fed. Cir. 1997), cert, denied, 523 U.S. 1089 (1998). In re 
Wilder, 736 F.2d 1516, 1521, 222 U.S.P.Q. 369, 372-3 (Fed. Cir. 
1984) . Factors to be considered in determining whether there is 
sufficient evidence of possession include the level of skill and 
knowledge in the art, partial structure, physical and/or chemical 
properties, functional characteristics alone or coupled with a 
known or disclosed correlation between structure and function, and 
the method of making the claimed invention. 

. As previously noted, perusal of the disclosure reveals the 
cloning and characterization of a novel human immunodeficiency 
virus type 1 originally designated lymphadenopathy associated virus 

(LAV) MAL, or LAVmal. A proviral molecular clone was obtained and 
complete nucleotide isolate of this sequence ascertained (see Figs. 
7A-7I) . The deduced amino acid sequences of the various viral 
structural and non- structural genes were also set forth in Figure 
7. Specific envelope polypeptide fragments were set forth on p. 36 
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of the specification (e.g., 1-530, 34-530, 531-877, 680-700, 37- 
130, 211-289, 488-530, and 490-620) . It should be noted that these 
designations actually referenced LAV B ru amino acid sequences, not 
specific LAVmal polypeptides. Thus, the skilled artisan might 
conclude that applicants contemplated making and using these 
specific envelope polypeptides. However, the skilled artisan would 
not reasonably conclude that applicants were in possession of the 
claimed invention. 

First, the disclosure fails to identify specific HIV- 1mm, 
immunogenic fragments of the claimed lengths and substitutions. 
The specification only sets forth the deduced amino acid sequences 
of the full-length non- structural and structural genes as set forth 
in Figure 7 and the specific Env fragments set forth on p. 36. 
Figure 3 also fails to identify immunogenic MAL peptides. This 
figure simply provides an amino acid comparison between MAL, BRU, 
ARV-2, and ELI to assess their genetic relatedness. The figure 
does not identify or lead the skilled artisan to any particular 
immunogenic fragment, particularly one carrying amino acid 
substitutions. Second, the disclosure fails to perform any type of 
comparison wherein specific immunogenic fragments from isolate MAL 
are identified and acceptable amino acid substitutions are 
performed. It is well-known in the art that subtle perturbations 
in an amino acid sequence can profoundly affect both the 
immunogenic and . antigenic properties of any given polypeptide. 
Thus, the skilled artisan can only hazard a guess as to which 
substituted MAL fragments will remain immunogenic. Third, the 
disclosure fails to provide adequate support for MAL-specific 
polypeptides the recited lengths (e.g., 21, 43, 79, 94, and 131 
aa) . The only numerical limitations set forth in the disclosure 
recite immunogenic polypeptides or fusion proteins which may 
contain- between 5 and 150 amino acids (see p. 28) . Thus, support 
does not exist for the current size limitations. Nothing in the 
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disclosure directs the skilled artisan toward any particular MAL 
immunogenic fragment or any fragment carrying amino acid 
substitutions. The disclosure fails to identify those molecular 
determinants modulating the immunogenicity of any given polypeptide 
fragment. Clearly, the claimed invention simply represents an 
attempt by applicants to capture subject matter which was neither 
described nor contemplated at the time of filing. Accordingly, the 
skilled artisan would reasonably conclude that applicants were not 
in possession of the claimed invention at the time of filing. 

Response to Arguments 

Applicants again traverse and submit that support for the 
claimed substitutions can be found in Figure 3. Applicants further 
reference pages 16 and 23 in support . The examiner still does not 
concur with this assessment. Figure 3 provides the amino acid 
sequence alignment of four different HIV-1 isolates (e.g., BRU, 
ARV-2, NIAL, and ELI). The purpose of this figure is to simply 
illustrate that while these are all HIV-1 isolates, nevertheless, 
they display considerable genetic diversity in the envelope region. 
Pages 16 and 23 also fail to provide support for the claimed 
invention. None of these pages sets forth any particular peptidic 
variants as currently claimed. 

Applicants further assert that consideration of Enzo Biochem, 
Inc. v. Gen-ProJbe, Jnc. , 323 F.3d 956 (Fed. Cir. 2002), is 
warranted. Applicants argue that the fact pattern is similar in 
both situations. It was noted that sufficient structural and 
functional information was provided in the specification. The 
examiner does not concur with this assessment. The problem with 
the current claims is that the skilled artisan would reasonably 
conclude that applicants were not in possession of the claimed 
invention. Nothing in the disclosure leads the skilled artisan to 
a polypeptide with one or more of the recited amino acid 
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substitutions. A more relevant piece of case law might be 
University of Rochester v. G.D. Searle & Co., 69 U.S.P.Q.2d 1886 
(C.A.F.C. 2004) . Here the court in discussing In re Ruschig, 379 
F.2d 990 [154 U.S.P.Q. 118] (C.C.P.A. 1967), noted that although a 
particular compound fell within the scope of the originally filed 
claim, nevertheless, "it was never named or otherwise exemplified 
in the appellants original patent application. " These facts are 
similar to those in the instant application wherein the applicants 
have failed to clearly identify or set forth any of the specific 
peptidic variants currently being claimed. As the Supreme court 
has previously cautioned, 14 a patent is not a hunting license. It 
is not a reward for the search, but compensation for its successful 
conclusion." Brenner v. Manson, 383 U.S. 519, 536 [148 U.S.P.Q. 
689] (1966). Accordingly, the rejection is proper and hereby 
maintained. 

Finality of Office Action 
THIS ACTION IS MADE FINAL . Applicant is reminded of the 
extension of time policy as set forth in 37 C.F.R. § 1.136(a). A 
SHORTENED STATUTORY PERIOD FOR RESPONSE TO THIS FINAL ACTION IS SET 
TO EXPIRE THREE MONTHS FROM THE DATE OF THIS ACTION. IN THE EVENT 
A FIRST RESPONSE IS FILED WITHIN TWO MONTHS OF THE MAILING DATE OF 
THIS FINAL ACTION AND THE ADVISORY ACTION IS NOT MAILED UNTIL AFTER 
THE END OF THE THREE -MONTH SHORTENED STATUTORY PERIOD, THEN THE 
SHORTENED STATUTORY PERIOD WILL EXPIRE ON THE DATE THE ADVISORY 
ACTION IS MAILED, AND ANY EXTENSION FEE PURSUANT TO 37 C.F.R. 
§ 1.136(a) WILL BE CALCULATED FROM THE MAILING DATE OF THE ADVISORY 
ACTION. IN NO EVENT WILL THE STATUTORY PERIOD FOR RESPONSE EXPIRE 
LATER THAN SIX MONTHS FROM THE DATE OF THIS FINAL ACTION. 
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Correspondence 



Any inquiry concerning this communication should be directed to 
Jeffrey S. Parkin, Ph.D./ whose telephone number is (571) 27^-0908. 
The examiner can normally be reached Monday through Thursday from 
10:30 AM to 9:00 PM. A message may be left on the examiner's voice 
mail service. If attempts to reach the examiner are unsuccessful, 
the examiner's supervisor, James C. Housel, can be reached at (571) 
272-0902. Direct general status inquiries to the Technology Center 
1600 receptionist at (571) 272-1600. Formal communications may be 
submitted through the official facsimile number which is (703) 872- 
9306. Hand-carried formal communications should be directed toward 
the customer window located in Crystal Plaza Two, 2 011 South Clark 
Place, Arlington, VA. Applicants are directed toward the O.G. 
Notice for further guidance. 1280 O.G. 681. Informal communications 
may be submitted to the Examiner's RightFAX account at (571) 273- 
0908. 

Information regarding the status of an application may be 
obtained from the Patent Application Information Retrieval (PAIR) 
system. Status information for published applications may be 
obtained from either Private PAIR or Public PAIR. Status 
information for unpublished applications is available through 
Private PAIR only. For more information about the PAIR system, see 
http://pair-direct.uspto.gov. Should you have questions on access 
to the Private PAIR system, contact the Electronic Business Center 
(EBC) at .866-217-9197 (toll-free). 




JefJ^y S. Parkin, Ph.D. 
Primary Examiner 
Art Unit 1648 



15 April, 2005 
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